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MAMMALIAN TOXICOLOGICAL RESPONSE MARKERS 

This application is filed under the Patent Cooperation Treaty and claims the benefit of U.S. 
Nonprovisional Application No. 09/443,184, our Docket No. PC-0007 US, filed 19 th November, 1999, 

5 

TECHNICAL FIELD 

The present invention relates to mammalian nucleic acid and protein molecules, and methods for 
their use in diagnostic and therapeutic applications including detecting metabolic and toxicological 
responses, and in monitoring drug mechanism of action. 

10 

BACKGROUND ART 

Toxicity testing is a mandatory and time-consuming part of drug development programs in the 
pharmaceutical industry. A more rapid screen to determine the effects upon metabolism and to detect 
toxicity of lead drug candidates may be the use of gene expression microarrays. For example, 

1 5 microarrays of various kinds may be produced using full length genes or gene fragments. These arrays 
can then be used to test samples treated with the drug candidates to elucidate the gene expression pattern 
associated with drug treatment. This gene pattern can be compared with gene expression patterns 
associated with compounds which produce known metabolic and toxicological responses. 

Benzo(a)pyrene is a known rodent and likely human carcinogen and is the prototype of a class of 

2 0 compounds, the polycyclic aromatic hydrocarbons (PAH). It is metabolized by several forms of 
cytochrome P450 (P450 isozymes) and associated enzymes to form both activated and detoxified 
metabolites. The ultimate metabolites are the bay-region diol epoxide, benzo(a)pyrene-7,8-diol-9, 10- 
epoxide (BPDE) and the K-region diol epoxide, 9-hydroxy benzo(a)pyrene-4,5 -oxide, both of which 
induce formation of DNA adducts. DNA adducts have been shown to persist in rat liver up to 56 days 

2 5 following treatment with benzo(a)pyrene at a dose of 1 0 mg/kg body weight three times per week tor two 
weeks (Qu and Stacey (1996) Carcinogenesis 17:53-59). 

Acetaminophen is a widely-used analgesic. It is metabolized by specific cytochrome P450 
isozymes with the majority of the drug undergoing detoxification by glucuronic acid, sulfate and 
glutathione conjugation pathways. However, at supratherapeutic doses, acetaminophen is metabolized to 

30 an active intermediate, N- acetyl -/> -benzoqui none imine (N APQI) which can cause hepatic and renal 
failure. NAPQI then binds to sulfhydryl groups of proteins causing their inactivation and leading to 
subsequent cell death (Kroger & aL (1997) Gen. Pharmacol. 28:257-263). 

Clofibrate is an hypolidemic drug which lowers elevated levels of serum triglycerides. In 
rodents, chronic treatment produces hepatomegaly and an increase in hepatic peroxisomes (peroxisome 
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proliferation). Peroxisome proliferators (PPs) are a class of drugs which activate the PP-activated 
receptor in rodent liver, leading to enzyme induction, stimulation of S-phase f and a suppression of 
apoptosis (Hasmall and Roberts (1999) Pharmacol. Ther. 82:63-70). PPs include the fibrate class of 
hypolidemic drugs, phenobarbitone, thiazolidinediones, certain non-steroidal anti-inflammatory drugs, 
5 and naturally-occuring fatty acid-derived molecules (Gelman et al. (1999) Cell. Mol. Life Sci. 55:932- 
943). Clofibrate has been shown to increase levels of cytochrome P450 4A. It is also involved in 
transcription of |3-oxidation genes as well as induction of PP-activated receptors (Kawashima et al. 
(1997) Arch. Biochem. Biophys. 347:148-154). Peroxisome proliferation that is induced by both 
clofibrate and the chemically-related compound fenofibrate is mediated by a common inhibitory effect on 

10 mitochondrial membrane depolarization (Zhou and Wallace (1999) Toxicol. Sci. 48:82-89). 

Toxicological effects in the liver are also induced by other compounds. These can include 
carbon tetrachloride (a necrotic agent), hydrazine (a steatotic agent), a-naphthylisothiocyanate (a 
cholestatic agent), 4-acetylaminofluorene (a liver mitogen), and their corresponding metabolites, which 
are used in experimental protocols to measure toxicological responses (Waterfield et al. (1993) Arch. 

15 Toxicol. 67:244-254). 

The present invention provides mammalian nucleic acid and protein molecules, their use in 
diagnostic and therapeutic applications including detecting metabolic and toxicological responses, and in 
monitoring drug mechanism of action. 

2 o DISCLOSURE OF INVENTION 

The invention provides a method for detecting or diagnosing the effect of a test compound or 
molecule associated with increased or decreased levels of nucleic acid molecules in a mammalian subject. 
The method comprises treating a mammalian subject with a known toxic compound or molecule which 
elicits a toxicological response, measuring levels of a plurality of nucleic acid molecules, selecting from 

25 the plurality of nucleic acid molecules those nucleic acid molecules that have levels modulated in 
samples treated with known toxic compounds or molecules when compared with untreated samples. 
Some of the levels may be upregulated by a toxic compound or molecule, others may be downregulated 
by a toxic compound or molecule, and still others may be upregulated with one known toxic compound or 
molecule and be downregulated with another known toxic compound or molecule. The selected nucleic 

30 acid molecules which are upregulated and downregulated by a known toxic compound or molecule are 
arrayed upon a substrate. The method further comprises measuring levels of nucleic acid molecules in 
the sample after the sample is treated with the toxic compound or molecule. Levels of nucleic acid 
molecules in a sample so treated are then compared with the plurality of the arrayed nucleic acid 
molecules to identify which sample nucleic acid molecules are upregulated and downregulated by the test 
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compound or molecule. In one embodiment, the nucleic acid molecules are hybridizable array elements 
of a microarray. 

Preferably, the comparing comprises contacting the arrayed nucleic acid molecules with the 
sample nucleic acid molecules under conditions effective to form hybridization complexes between the 
5 arrayed nucleic acid molecules and the sample nucleic acid molecules; and detecting the presence or 

absence of the hybridization complexes. In this context, similarity may mean that at least 1, preferably at 
least 5, more preferably at least 10, of the upregulated arrayed nucleic acid molecules form hybridization 
complexes with the sample nucleic acid molecules at least once during a time course to a greater extent 
than would the probes derived from a sample not treated with the test compound or molecule or a known 

10 toxic compound or molecule. Similarity may also mean that at least 1, preferably at least 5, more 

preferably at least 10, of the downregulated arrayed nucleic acid molecules form hybridization complexes 
with the sample nucleic acid molecules at least once during a time course to a lesser extent than would 
the sample nucleic acid molecules of a sample not treated with the test compound or a known toxic 
compound. In one aspect, the arrayed nucleic acid molecules comprise SEQ ID NOs: 1-47 or fragments 

15 thereof. 

Preferred toxic compounds are selected from the group consisting of hypolipidemic drugs, n- 
alkylcarboxylic acids, n-alkylcarboxylic acid precursors, azole antifungal compounds, leukotriene D4 
antagonists, herbicides, pesticides, phthalate esters, phenyl acetate, dehydroepiandrosterone (DHEA), 
oleic acid, methanol and their corresponding metabolites, acetaminophen and its corresponding 

2 0 metabolites, benzo(a)pyrene, 3-methylcholanthrene, benz(a)anthracene, 7, 12-dimethylbenz(a)anthracene, 
their corresponding metabolites, and the like, carbon tetrachloride, hydrazine, a-naphthylisothiocyanate, 
4-acetylaminofluorene, and their corresponding metabolites. Preferred tissues are selected from the 
group consisting of liver, kidney, brain, spleen, pancreas and lung. 

The arrayed nucleic acid molecules comprise fragments of messenger RNA transcripts of genes 

25 that are upregulated-or-downregulated at least 2-fold, preferably at least 2.5-fold, more preferably at least 
3-fold, in tissues treated with known toxic compounds when compared with untreated tissues. Preferred 
arrayed nucleic acid molecules are selected from the group consisting of SEQ ID NOs: 1-47 or fragments 
thereof, some of whose expression is upregulated following treatment with a toxic compound or molecule 
and others of whose expression is downregulated following treatment with a toxic compound or 

30 molecule. More preferable are SEQ ID NOs:2, 4, 6, 8, 9, and 1 1 which are upregulated following 

treatment with a toxic compound or molecule, and SEQ ID NOs:l, 4, and 7 which are downregulated 
following treatment with a toxic compound or molecule. 

The invention also provides a method comprising measuring levels of nucleic acid molecules in a 
sample after the sample is treated with a test compound or molecule. Levels of nucleic acid molecules in 
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a sample so treated are then compared with the plurality of the arrayed nucleic acid molecules to identify 
which sample nucleic acid molecules are upregulated and downregulated by the test compound or 
molecule. In one embodiment, the nucleic acid molecules are hybridizable array elements of a 
microarray. 

5 Alternatively, the invention provides methods for screening a sample for a metabolic response to 

a test compound or molecule. 

Alternatively, the invention provides methods for screening a test compound or molecule for a 
previously unknown metabolic response. 

In another aspect, the invention provides methods for preventing a toxicological response by 
10 administering complementary nucleotide molecules against one or more selected upregulated nucleic acid 
molecules or a ribozyme that specifically cleaves such molecules. Alternatively, a toxicological response 
may be prevented by administering sense nucleotide molecules for one or more selected downregulated 
nucleic acid molecules. 

In yet another aspect, the invention provides methods for preventing a toxicological response by 
15 administering an agonist which initiates transcription of a gene comprising a downregulated nucleic acid 
molecule of the invention. Alternatively, a toxicological response may be prevented by administering an 
antagonist which prevents transcription of a gene comprising an upregulated nucleic acid molecule of the 
invention. 

In another aspect, the invention provides nucleic acid molecules whose transcript levels are 
20 modulated in a sample during a metabolic response to a toxic compound or molecule. The invention also 
provides nucleic acid molecules whose transcript levels are upregulated in a sample during a metabolic 
response to a toxic compound or molecule. The invention also provides nucleic acid molecules whose 
transcript levels are downregulated in a sample during a metabolic response to a toxic compound or 
molecule. Upregulation or downregulation is at least 2-fold, more preferably at least 2.5-fold, even more 
25 preferably at least 3-fold. The metabolic response to a toxic compound or molecule may be a 

toxicological response. The invention also provides mammalian nucleic acid molecules which are 
homologous to the upregulated and downregulated nucleic acid molecules. In one aspect, preferred 
arrayed nucleic acid molecules are selected from the group consisting of SEQ ID NOs:l-47, or fragments 
thereof. 

3 o The invention also provides a method for using a molecule selected from SEQ ID NOs:l-59 or a 

portion thereof to screen a library of molecules to identify at least one ligand which specifically binds the 
selected molecule, the method comprising combining the selected molecule with the library of molecules 
under conditions allowing specific binding, and detecting specific binding, thereby identifying a ligand 
which specifically binds the selected molecule. 
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Such libraries include DNA and RNA molecules, peptides, peptide nucleic acids, agonists, 
antagonists, antibodies, immunoglobulins, drug compounds, pharmaceutical agents, and other ligands. In 
one aspect, the ligand identified using the method modulates the activity of the selected molecule. In an 
analogous method, the selected molecule or a portion thereof is used to purify a ligand. The method 
5 involves combining the selected molecule or a portion thereof with a sample under conditions to allow 
specific binding, detecting specific binding between the selected molecule and ligand, recovering the 
bound selected molecule, and separating the selected molecule from the ligand to obtain purified ligand. 
The invention further provides a method for using at least a portion of the proteins encoded by SEQ ID 
NOs:l-47 and the proteins of SEQ ID NOs:48-59 to produce antibodies. 

10 The invention further provides a method for inserting a marker gene into the genomic DNA of an 

animal to disrupt the expression of the natural nucleic acid molecule. The invention also provides a 
method for using the nucleic acid molecule to produce an animal model system, the method comprising . 
constructing a vector containing the nucleic acid molecule; introducing the vector into a totipotent 
embryonic stem cell; selecting an embryonic stem cell with the vector integrated into genomic DNA; 

15 microinjecting the selected cell into a blastocyst, thereby forming a chimeric blastocyst; transferring the 
chimeric blastocyst into a pseudopregnant dam, wherein the dam gives birth to a chimeric animal 
containing at least one additional copy of nucleic acid molecule in its germ line; and breeding the 
chimeric animal to generate a homozygous animal model system. 

The invention also provides a substantially purified mammalian protein or a portion thereof. The 

2 0 invention further provides isolated and purified proteins encoded by the nucleic acid molecules of SEQ 

ID NOs:l-l 1, 17-33, 36, 39, and 41. The invention further provides isolated and purified protein 
molecule of SEQ ID NOs:50 and 53. Additionally, the invention provides a pharmaceutical composition 
comprising a substantially purified mammalian protein or a portion thereof in conjunction with a 
pharmaceutical carrier. 

25 The invention further provides an isolated and purified mammalian nucleic acid molecule variant 

having at least 70% nucleic acid sequence identity to the mammalian nucleic acid molecule selected from 
SEQ ID NO: 1-47 and fragments thereof. The invention also provides an isolated and purified nucleic 
acid molecule having a sequence which is complementary to the mammalian nucleic acid molecule 
comprising a nucleic acid molecule selected from SEQ ID NO: 1-47 and fragments thereof. 

3 o The invention further provides an expression vector containing at least a fragment of the 

mammalian nucleic acid molecule selected from the group consisting of SEQ ID NOs:l-47. In another 
aspect, the expression vector is contained within a host cell. 

The invention also provides a method for producing a mammalian protein, the method 
comprising the steps of: (a) culturing the host cell containing an expression vector containing a 
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mammalian nucleic acid molecule of the invention under conditions suitable for the expression of the 
polypeptide; and (b) recovering the polypeptide from the host cell culture. 

The invention also provides a pharmaceutical composition comprising a substantially purified 
mammalian protein encoded by SEQ ID NOs:l-l 1, 17-33, 36, 39, and 41 and the amino acid sequence of 
5 SEQ ID NOs:50 and 53 and fragments thereof, in conjunction with a suitable pharmaceutical carrier. 

The invention further includes an isolated and purified antibody which binds to a mammalian 
protein encoded by SEQ ID NOs: 1-1 1, 17-33, 36, 39, and 41 and mammalian protein of SEQ ID NOs:50 
and 53 or fragments thereof. The invention also provides a purified agonist and a purified antagonist. 

10 BRIEF DESCRIPTION OF THE SEQUENCE LISTING 

A portion of the disclosure of this patent document contains material which is subject to 

copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of 

the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file 

or records, but otherwise reserves all copyright rights whatsoever. 
15 The Sequence Listing contains the nucleic acid sequence of exemplary mammalian nucleic acid 

molecules of the invention, SEQ ID NOs: 1-47, 60-135, 137, and 138; the protein sequence of exemplary 

mammalian protein molecules of the invention, SEQ ID NOs:48-59 and 136. 

MODES FOR CARRYING OUT THE INVENTION 

20 Definitions 

"Sample" is used in its broadest sense. A sample containing nucleic acid molecules may 
comprise a bodily fluid; a cell; an extract from a cell, chromosome, organelle, or membrane isolated from 
a cell; genomic DNA, RNA, or cDNA in solution or bound to a substrate; a biological tissue or biopsy 
thereof; a fingerprint or tissue print; natural or synthetic fibres; in a solution; in a liquid suspension; in a 
25 gaseous suspension; in an aerosol; and the like. 

"Plurality" refers preferably to a group of one or more members, preferably to a group of at least 
about 10, and more preferably to a group of at least about 100 members, and even more preferably a 
group of 10,000 members. 

"Substrate" refers to a rigid or semi-rigid support to which nucleic acid molecules or proteins are 
30 bound and includes membranes, filters, chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, 
capillaries or other tubing, plates, polymers, and microparticles with a variety of surface forms including 
wells, trenches, pins, channels and pores. 

"Modulates" refers to a change in activity (biological, chemical, or immunological) or lifespan 
resulting from specific binding between a molecule and either a nucleic acid molecule or a protein. 
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"Microarray" refers to an ordered arrangement of hybridizable array elements on a substrate. The 
array elements are arranged so that there are preferably at least ten or more different array elements, more 
preferably at least 100 array elements, even more preferably at least 1000 array elements, and most 
preferably 10,000. Furthermore, the hybridization signal from each of the array elements is individually 
5 distinguishable. In a preferred embodiment, the array elements comprise nucleic acid molecules. 

"Nucleic acid molecule" refers to a nucleic acid, oligonucleotide, nucleotide, polynucleotide or 
any fragment thereof. It may be DNA or RNA of genomic or synthetic origin, double-stranded or 
single-stranded, and combined with carbohydrate, lipids, protein, or other materials to perform a 
particular activity such as transformation or form a useful composition such as a peptide nucleic acid 
10 (PNA). "Oligonucleotide" is substantially equivalent to the terms amplimer, primer, oligomer, element, 
target, and probe and is preferably single stranded. 

"Protein" refers to an amino acid sequence, oligopeptide, peptide, polypeptide, or portions 
thereof whether naturally occurring or synthetic. Exemplary portions are the first twenty consecutive 
amino acids of a mammalian protein encoded by SEQ ID NOs: 1-11, 17-33, 36, 39, and 41 and 
15 mammalian protein of SEQ ID NOs:50 and 53. 

"Up-regulated" refers to a nucleic acid molecule whose levels increased in a treated sample 
compared with the nucleic acid molecule in an untreated sample. 

"Down-regulated" refers to nucleic acid molecule whose levels decreased in a treated sample 
compared with the nucleic acid molecule in an untreated sample. 
2 o "Toxic compound" or "toxic agent" is any compound, molecule, or agent that elicits a 

biochemical, metabolic, and physiological response in an individual or animal, such as i) DNA damage, 
ii) cell damage, iii) organ damage or cell death, or iv) clinical morbidity or mortality. 

"Toxicological response" refers to a biochemical, metabolic, and physiological response in an 
individual or animal which has been exposed to a toxic compound or agent. 

2 5 "Fragment" refers to an Incyte clone or any part of a molecule which retains a usable, functional 

characteristic. Useful fragments include oligonucleotides and polynucleotides which may be used in 
hybridization or amplification technologies or in regulation of replication, transcription or translation. 
Exemplary fragments are the first sixty consecutive nucleotides of SEQ ID NOs: 1-47. Useful fragments 
also include polypeptides and protein molecules which have antigenic potential and which may be used 

3 0 with a suitable pharmaceutical carrier in a pharmaceutical composition. Exemplary fragments are the 

first twenty consecutive amino acids of a mammalian protein encoded by SEQ ID NOs: 1-11, 17-33, 36, 
39, and 41 and mammalian protein of SEQ ID NOs:50 and 53. 

"Hybridization complex" refers to a complex between two nucleic acid molecules by virtue of the 
formation of hydrogen bonds between purines and pyrimidines. 
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"Ligand" refers to any compound, molecule, or agent which will bind specifically to a 
complementary site on a nucleic acid molecule or protein. Such ligands stabilize or modulate the activity 
of nucleic acid molecules or proteins of the invention and may be composed of at least one of the 
following: inorganic and organic substances including nucleic acids, proteins, carbohydrates, fats, and 
lipids. 

"Percent identity" or "% identity" refers to the percentage of sequence similarity found in a 
comparison of two or more amino acid or nucleic acid sequences. Percent identity can be determined 
electronically, e.g., by using the MEGALIGN program (DNASTAR, Madison WI) which creates 
alignments between two or more sequences according to methods selected by the user, e.g., the clustal 
method. (See, e.g., Higgins, D.G. and P.M. Sharp (1988) Gene 73:237-244.) The clustal algorithm 
groups sequences into clusters by examining the distances between all pairs. The clusters are aligned 
pairwise and then in groups. The percentage similarity between two amino acid sequences, e.g., 
sequence A and sequence B, is calculated by dividing the length of sequence A, minus the number of gap 
residues in sequence A, minus the number of gap residues in sequence B, into the sum of the residue 
matches between sequence A and sequence B, times one hundred. Gaps of low or of no similarity 
between the two amino acid sequences are not included in determining percentage similarity. Percent 
identity between nucleic acid sequences can also be counted or calculated by other methods known in the 
art, e.g., the Jotun Hein method. (See, e.g., Hein, J. (1990) Methods Enzymol. 183:626-645.) Identity 
between sequences can also be determined by other methods known in the art, e.g., by varying 
hybridization conditions. 

"Substantially purified" refers to nucleic acid molecules or proteins that are removed from their 
natural environment and are isolated or separated, and are at least about 60% free, preferably about 75% 
free, and most preferably about 90% free, from other components with which they are naturally 
associated. 

The Invention 

The present invention provides mammalian nucleic acid and protein molecules and method of 
using the nucleic acid molecules for screening test compounds and molecules for toxicological responses. 
Additionally the invention provides methods for characterizing the toxicological responses of a sample 
to a test compound or molecule. In particular, the present invention provides a composition comprising a 
plurality of nucleic acid molecules derived from human cDNA libraries, monkey cDNA libraries, mouse 
cDNA libraries, normal rat liver cDNA libraries, normalized rat liver cDNA libraries, prehybridized rat 
liver cDNA libraries, subtracted rat liver cDNA libraries, and rat kidney cDNA libraries. The nucleic 
acid molecules have been further selected for exhibiting upregulated or downregulated gene expression 
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in rat livers when the rats have been exposed to a known hepatotoxin, including a peroxisomal 
proliferator (PP), acetaminophen or one of its corresponding metabolites, a polycyclic aromatic 
hydrocarbon (PAH), carbon tetrachloride, hydrazine, a-naphthylisothiocyanate, 4-acetylaminofluorene, 
and their corresponding metabolites. 
5 PPs include hypolipidemic drugs, such as clofibrate, fenofibrate, clofenic acid, nafenopin, 

gemfibrozil, ciprofibrate, bezafibrate, halofenate, simfibrate, benzofibrate, etofibrate, WY- 14,643, and 
the like; n-alkylcarboxylic acids, such as trichloroacetic acid, valproic acid, hexanoic acid, and the like; 
n-alkylcarboxylic acid precursors, such as trichloroethylene, etrachloroethylene, and the like; azole 
antifungal compounds, such as bifonazole, and the like; leukotriene D4 antagonists; herbicides; 
10 pesticides; phthalate esters, such as di-[2-ethylhexyl] phthalate, mono-[2-ethylhexyl] phthalate, and the 
like; and natural chemicals, such as phenyl acetate, dehydroepiandrosterone (DHEA), oleic acid, 
methanol, and the like. In a preferred embodiment the toxin is clofibrate, or one of its corresponding 
metabolites. In another prefered embodiment the toxin is fenofibrate, or one of its corresponding 
metabolites. 

15 PAHs include compounds such as benzo(a)pyrene, 3-methylcholanthrene, benz(a)anthracene, 

7,12-dimethylbenz(a)anthracene, their corresponding metabolites, and the like. In a preferred 
embodiment the toxin is benzo(a)pyrene, or one of its corresponding metabolites. 

SEQ ID NOs: 1-16 were identified by their pattern of at least two-fold upregulation or 
downregulation following hybridization with sample nucleic acid molecules from rat liver tissue treated 

20 with a known toxic compound. SEQ ID NOs: 17-47 were identified by their homology to the sample 
nucleic acid molecules from rat liver tissue treated with a known toxic compound. These and other 
nucleic acid molecules can be immobilized on a substrate as hybridizable array elements in a microarray 
format. The microarray may be used to characterize gene expression patterns associated with novel 
compounds to elucidate any toxicological responses or to monitor the effects of treatments during clinical 

25 trials or therapy where metabolic responses to toxic compounds may be expected. 

When the nucleic acid molecules are employed as hybridizable array elements in a microarray, 
the array elements are organized in an ordered fashion so that each element is present at a specified 
location on the substrate. Because the array elements are at specified locations on the substrate, the 
hybridization patterns and intensities (which together create a unique expression profile) can be 

30 interpreted in terms of expression levels of particular genes and can be correlated with a toxicological 
response associated with a test compound or molecule. 

The invention also provides a substantially purified and isolated mammalian protein comprising 
the protein molecule of SEQ ID NOs:50 and 53 or portion thereof. The invention further provides 
isolated and purified proteins encoded by the nucleic acid molecules of SEQ ED NOs: 1-1 1, 17-33, 36, 39, 
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and 41, or portion thereof. 

Furthermore, the present invention provides methods for screening test compounds or 
therapeutics for potential toxicological responses and for screening a sample's toxicological response to a 
particular test compound or molecule. Briefly, these methods entail treating a sample with the test 
5 compound or molecule to elicit a change in gene expression patterns comprising the expression of a 
plurality of sample nucleic acid molecules. Nucleic acid molecules are selected by identifying those 
genes in rat liver or kidney that are upregulated-or-downregulated at least 2-fold, more preferably at least 
2.5-fold, most preferably at least 3-fold, when treated with a known toxic compound or molecule. The 
nucleic acid molecules are arrayed on a substrate. Then, the arrayed nucleic acid molecules and sample 

10 nucleic acid molecules are combined under conditions effective to form hybridization complexes which 
may be detected by methods well known in the art. Detection of higher or lower levels of such 
hybridization complexes compared with hybridization complexes derived from untreated samples and 
samples treated with a compound that is known not to induce a toxicological response correlates with a 
toxicological response of a test compound or a toxicological response to a molecule. 

15 Complementary DNA libraries 

Molecules are identified that reflect all or most of the genes that are expressed in rat liver or 
kidney. Molecules may be identified by isolating clones derived from several types of rat cDNA 
libraries, including normal rat cDNA libraries, normalized rat cDNA libraries, prehybridized rat cDNA 
libraries, and subtracted cDNA libraries. Clone inserts derived from these clones may be partially 

20 sequenced to generate expressed sequence tags (ESTs). Molecules are also identified by comparing the 
clones from rat cDNA libraries with clones from human, monkey, and mouse cDNA libraries using 
computer software nucleic acid comparison programs such as BLAST (see, e.g., Altschul, S.F, (1993) J. 
Mol. Evol. 3:290-300; Altschul, era! (1990) J. Mol. Biol. 215:403-410). 

In one embodiment, two collections of ESTs are identified and sequenced. A first collection of 

25 ESTs (the originator molecules) are derived from rat liver and kidney and are derived from the cDNA 
libraries presented in the Examples. A second collection includes ESTs derived from other rat cDNA 
libraries available in the ZOOSEQ database (Incyte Pharmaceuticals, Inc. Palo Alto CA). 

The two collections of ESTs are clustered electronically to form master clusters of ESTs. Master 
clusters are formed by identifying overlapping EST molecules and assembling these ESTs. A nucleic 

3 0 acid fragment assembly tool, such as the Phrap tool (Phil Green, University of Washington) and the 

GELVIEW fragment assembly system (GCG, Madison WI), can be used for this purpose. The minimum 
number of clones which constitute a cluster is two. In another embodiment, a collection of human genes 
known to be expressed in response to toxic agents are used to select representative ESTs from the 1 13 rat 
cDNA libraries. The master cluster process is repeated for these molecules. 
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After assembling the clustered consensus nucleic acid sequences, a representative 5* clone is 
nominated from each master cluster. The most 5* clone is preferred because it is most likely to contain 
the complete gene. The nomination process is described in greater detail in "Relational Database and 
System for Storing Information Relating to Biomolecular Sequences and Reagents", USSN 09/034,807, 
5 filed March 4, 1998, herein incorporated in its entirety by reference. The EST molecules are used as 
array elements on a microarray. 
Selection of arrayed nucleic acid molecules 

Samples are treated, preferably at subchronic doses, with one or more known toxic compounds 
over a defined time course. Preferably, the agents are peroxisomal proliferators (PPs), acetaminophen or 
10 one of its corresponding metabolites, polycyclic aromatic hydrocarbons (PAHs), carbon tetrachloride, 
hydrazine, a-naphthylisothiocyanate, 4-acetylaminofluorene, or their corresponding metabolites. 

The gene expression patterns derived from such treated biological samples can be compared with 
the gene expression patterns derived from untreated biological samples to identify and select nucleic acid 
molecules whose expression is either upregulated or downregulated due to the response to the toxic 
15 compounds. These selected molecules may then be employed as array elements alone or in combination 
with other array element molecules. Such a microarray is particularly useful to detect and characterize 
gene expression patterns associated with known toxic compounds. Such gene expression patterns can 
then be used for comparison to identify other compounds which also elicit a toxicoiogicai response. 
The arrayed nucleic acid molecules can be manipulated to optimize their performance in 
20 hybridization. To optimize hybridization, the arrayed nucleic acid molecules are examined using a 

computer algorithm to identify portions of genes without potential secondary structure. Such computer 
algorithms are well known in the art and are part of OLIGO 4.06 primer analysis software (National 
Biosciences, Plymouth MN) or LASERGENE software (DNASTAR, Madison WI). These programs can 
search within nucleic acid sequences to identify stem loop structures and tandem repeats and to analyze G 
25 + C content of the sequence (those molecules with a G + C content greater than 60% are excluded). 

Alternatively, the arrayed nucleic acid molecules can be optimized by trial and error. Experiments can be 
performed to determine whether sample nucleic acid molecules and complementary arrayed nucleic acid 
molecules hybridize optimally under experimental conditions. 

The arrayed nucleic acid molecules can be any RNA-like or DNA-like material, such as mRNAs, 
3 0 cDNAs, genomic DNA, peptide nucleic acids, branched DNAs and the like. The arrayed nucleic acid 
molecules can be in sense or antisense orientations. 

In one embodiment, the arrayed nucleic acid molecules are cDNAs. The size of the DNA 
sequence of interest may vary, and is preferably from 50 to 10,000 nucleotides, more preferably from 150 
to 3,500 nucleotides. In a second embodiment, the nucleic acid molecules are vector DNAs. In this case 
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the size of the DNA sequence of interest, i.e., the insert sequence, may vary from about 50 to 10,000 
nucleotides, more preferably from about 150 to 3,500 nucleotides. 

The nucleic acid molecule sequences of the Sequence Listing have been prepared by current, 
state-of-the-art, automated methods and, as such, may contain occasional sequencing errors and 
unidentified nucleotides. Nucleotide analogues can be incorporated into the nucleic acid molecules by 
methods well known in the art. The only requirement is that the incorporated nucleotide analogues must 
serve to base pair with sample nucleic acid molecules. For example, certain guanine nucleotides can be 
substituted with hypoxanthine which base pairs with cytosine residues. However, these base pairs are 
less stable than those between guanine and cytosine. Alternatively, adenine nucleotides can be 
substituted with 2,6-diaminopurine which can form stronger base pairs than those between adenine and 
thymidine. Additionally, the nucleic acid molecules can include nucleotides that have been derivatized 
chemically or enzymatically. Typical modifications include derivatization with acyl, alkyl, aryl or amino 
groups. 

The nucleic acid molecules can be immobilized on a substrate via chemical bonding. 
Furthermore, the molecules do not have to be directly bound to the substrate, but rather can be bound to 
the substrate through a linker group. The linker groups are typically about 6 to 50 atoms long to provide 
exposure to the bound nucleic acid molecule. Preferred linker groups include ethylene glycol oligomers, 
diamines, diacids and the like. Reactive groups on the substrate surface react with one of the terminal 
portions of the linker to bind the linker to the substrate. The other terminal portion of the linker is then 
functionalized for binding the nucleic acid molecule. Preferred substrates are any suitable rigid or 
semirigid support, including membranes, filters, chips, slides, wafers, fibers, magnetic or nonmagnetic 
beads, gels, tubing, plates, polymers, microparticles and capillaries. The substrate can have a variety of 
surface forms, such as wells, trenches, pins, channels and pores, to which the arrayed nucleic acid 
molecules are bound. 

The samples can be any sample comprising sample nucleic acid molecules and obtained from any 
bodily fluid (blood, urine, saliva, phlegm, gastric juices, etc.), cultured cells, biopsies, or other tissue 
preparations. The samples can be derived from any species, but preferably from eukaryotic species, and 
more preferably from mammalian species such as rat and human. 

DNA or RNA can be isolated from the sample according to any of a number of methods well 
known to those of skill in the art. For example, methods of purification of nucleic acids are described in 
Tijssen, P. (1993) Laboratory Techniques in Biochemistry and Molecular Biolopv: Hybridization With 
Nucleic Acid Probes. Part I. Theory and Nucleic Acid Preparation . Elsevier, New York, NY. In one 
preferred embodiment, total RNA is isolated using the TRIZOL total RNA isolation reagent (Life 
Technologies, Inc., Gaithersburg MD) and mRNA is isolated using oligo d(T) column chromatography or 
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glass beads. When sample nucleic acid molecules are amplified it is desirable to amplify the sample 
nucleic acid molecules and maintain the relative abundances of the original sample, including low 
abundance transcripts. RNA can be amplified in vitro , in situ , or in vivo (See Eberwine US Patent No. 
5,514,545). 

5 It is also advantageous to include controls within the sample to assure that amplification and 

labeling procedures do not change the true distribution of nucleic acid molecules in a sample. For this 
purpose, a sample is spiked with an amount of a control nucleic acid molecule predetermined to be 
detectable upon hybridization to its complementary arrayed nucleic acid molecule and the composition of 
nucleic acid molecules includes reference nucleic acid molecules which specifically hybridize with the 
10 control arrayed nucleic acid molecules. After hybridization and processing, the hybridization signals 
obtained should reflect accurately the amounts of control arrayed nucleic acid molecules added to the 
sample. 

Prior to hybridization, it may be desirable to fragment the sample nucleic acid molecules. 
Fragmentation improves hybridization by minimizing secondary structure and cross-hybridization to 
15 other sample nucleic acid molecules in the sample or noncomplementary nucleic acid molecules. 
Fragmentation can be performed by mechanical or chemical means. 
Labeling 

The sample nucleic acid molecules may be labeled with one or more labeling moieties to allow 
for detection of hybridized arrayed/sample nucleic acid molecule complexes. The labeling moieties can 

20 include compositions that can be detected by spectroscopic, photochemical, biochemical, bioelectronic, 
immunochemical, electrical, optical or chemical means. The labeling moieties include radioisotopes, 
such as 32 P, 33 P or 35 S, chemiluminescent compounds, labeled binding proteins, heavy metal atoms, 
spectroscopic markers, such as fluorescent markers and dyes, magnetic labels, linked enzymes, mass 
spectrometry tags, spin labels, electron transfer donors and acceptors, and the like. Preferred fluorescent 

25 markers include Cy3 and Cy5 fluorophores (Amersham Pharmacia Biotech, Piscataway NJ). 
Hybridization 

The nulceic acid molecule sequence of SEQ ID NOs: 1 -47 and fragments thereof can be used in 
various hybridization technologies for various purposes. Hybridization probes may be designed or 
derived from SEQ ID NOs: 1-47. Such probes may be made from a highly specific region such as the 5 ' 
3 o regulatory region or from a conserved motif, and used in protocols to identify naturally occurring 

sequences encoding the mammalian protein, allelic variants, or related sequences, and should preferably 
have at least 50% sequence identity to any of the protein sequences. The hybridization probes of the 
subject invention may be DNA or RNA and may be derived from the sequence of SEQ ID NOs: 1-47 or 
from genomic sequences including promoters, enhancers, and introns of the mammalian gene. 
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Hybridization or PCR probes may be produced using oligolabeling, nick translation, end-labeling, or PCR 
amplification in the presence of the labeled nucleotide. A vector containing the nucleic acid sequence 
may be used to produce an mRNA probe in vitro by addition of an RNA polymerase and labeled nucleic 
acid molecules. These procedures may be conducted using commercially available kits such as those 
5 provided by Amersham Pharmacia Biotech. 

The stringency of hybridization is determined by G+C content of the probe, salt concentration, 
and temperature. In particular, stringency can be increased by reducing the concentration of salt or 
raising the hybridization temperature. In solutions used for some membrane based hybridizations, 
additions of an organic solvent such as formamide allows the reaction to occur at a lower temperature. 

10 Hybridization can be performed at low stringency with buffers, such as 5 x SSC with 1% sodium dodecyl 
sulfate (SDS) at 60°C, which permits the formation of a hybridization complex between nucleotide 
sequences that contain some mismatches. Subsequent washes are performed at higher stringency with 
buffers such as 0.2 x SSC with 0.1% SDS at either 45°C (medium stringency) or 68°C (high stringency). 
At high stringency, hybridization complexes will remain stable only where the nucleic acid sequences are 

15 completely complementary. In some membrane-based hybridizations, preferably 35% or most preferably 
50%, formamide can be added to the hybridization solution to reduce the temperature at which 
hybridization is performed, and background signals can be reduced by the use of other detergents such as 
Sarkosyl or Triton X-100 and a blocking agent such as salmon sperm DNA. Selection of components and 
conditions for hybridization are well known to those skilled in the art and are reviewed in Ausubel 

2 0 (supra) and Sambrook et_al . ( 1 989) Molecular Cloning. A Laboratory Manual , Cold Spring Harbor Press, 
Plainview NY. 

Hybridization specificity can be evaluated by comparing the hybridization of specificity-control 
nucleic acid molecules to specificity-control sample nucleic acid molecules that are added to a sample in 
a known amount. The specificity-control arrayed nucleic acid molecules may have one or more sequence 

25 mismatches compared with the corresponding arrayed nucleic acid molecules. In this manner, whether 
only complementary arrayed nucleic acid molecules are hybridizing to the sample nucleic acid molecules 
or whether mismatched hybrid duplexes are forming is determined. 

Hybridization reactions can be performed in absolute or differential hybridization formats. In the 
absolute hybridization format, nucleic acid molecules from one sample are hybridized to the molecules in 

30 a microarray format and signals detected after hybridization complex formation correlate to nucleic acid 
molecule levels in a sample. In the differential hybridization format, the differential expression of a set 
of genes in two biological samples is analyzed. For differential hybridization, nucleic acid molecules 
from both biological samples are prepared and labeled with different labeling moieties. A mixture of the 
two labeled nucleic acid molecules is added to a microarray. The microarray is then examined under 
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conditions in which the emissions from the two different labels are individually detectable. Molecules in 
the microarray that are hybridized to substantially equal numbers of nucleic acid molecules derived from 
both biological samples give a distinct combined fluorescence (Shalon et al. PCT publication 
WO95/35505), In a preferred embodiment, the labels are fluorescent markers with distinguishable 
5 emission spectra, such as Cy3 and Cy5 fluorophores. 

After hybridization, the microarray is washed to remove nonhybridized nucleic acid molecules 
and complex formation between the hybridizable array elements and the nucleic acid molecules is 
detected. Methods for detecting complex formation are well known to those skilled in the art. In a 
preferred embodiment, the nucleic acid molecules are labeled with a fluorescent label and measurement 

10 of levels and patterns of fluorescence indicative of complex formation is accomplished by fluorescence 
microscopy, preferably confocal fluorescence microscopy. 

In a differential hybridization experiment, nucleic acid molecules from two or more different 
biological samples are labeled with two or more different fluorescent labels with different emission 
wavelengths. Fluorescent signals are detected separately with different photomultipliers set to detect 

15 specific wavelengths. The relative abundances/expression levels of the nucleic acid molecules in two or 
more samples is obtained. 

Typically, microarray fluorescence intensities can be normalized to take into account variations 
in hybridization intensities when more than one microarray is used under similar test conditions. In a 
preferred embodiment, individual arrayed-sample nucleic acid molecule complex hybridization intensities 

20 are normalized using the intensities derived from internal normalization controls contained on each 
microarray. 

The labeled sample emits specific wavelengths which are detected using a plurality of 
photomultipliers. The nucleic acid molecules whose relative abundance/expression levels are modulated 
by treatment of a sample with a known toxic compound can be used as hybridizable elements in a 

25 microarray. Such a microarray can be employed to identify expression profiles associated with particular 
toxicological responses. Then, a particular subset of these photomultipliers set to detect specific 
wavelengths. The relative expression levels of the arrayed nucleic acid molecules can be identified as to 
which arrayed nucleic acid molecule expression is modulated in response to a particular toxicological 
agent. These photomultipliers are set to detect specific wavelengths. The relative expression levels of 

3 0 the nucleic acid molecules can be employed to identify other compounds with a similar toxicological 
response. 

Alternatively, for some treatments with known side effects, the microarray, and expression 
patterns derived therefrom, is employed to prospectively define the treatment regimen. A dosage is 
established that minimizes expression patterns associated with undesirable side effects. This approach 
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may be more sensitive and rapid than waiting for the patient to show toxicological side effects before 
altering the course of treatment. 

Generally, the method for screening a library of test compounds or molecules to identify those 
with a toxicological response entails selecting a plurality of arrayed genes whose expression levels are 
5 modulated in tissues treated with known toxic compounds when compared with untreated tissues. Then a 
sample is treated with the test compound or molecule to induce a pattern of gene expression comprising 
the expression of a plurality of sample nucleic acid molecules . Tissues from a mammalian subject treated 
at various dosages of the test compound may be screened to determine which doses may be toxic. 
Then, the expression levels of the arrayed genes and the sample nucleic acid molecules are 

10 compared to identify those compounds that induce expression levels of the sample nucleic acid molecules 
that are similar to those of the arrayed genes. In one preferred embodiment, gene expression levels are 
compared by contacting the arrayed genes with the sample nucleic acid molecules under conditions 
effective to form hybridization complexes between arrayed genes and sample nucleic acid molecules; and 
detecting the presence or absence of the hybridization complexes. 

15 Similarity may mean that at least 1 , preferably at least 5, more preferably at least 10, of the 

upregulated arrayed genes form hybridization complexes with the sample nucleic acid molecules at least 
once during a time course to a greater extent than would the nucleic acid molecules of a sample not 
treated with the test compound. Similarity may also mean that at least 1, preferably at least 5, more 
preferably at least 10, of the downregulated nucleic acid molecules form hybridization complexes with 

2 0 the arrayed genes at least once during a time course to a lesser extent than would the nucleic acid 
molecules of a sample not treated with the test compound. 

Such a similarity of expression patterns means that a toxicological response is associated with the 
compound or therapeutic tested. Preferably, the toxic compounds belong to the class of peroxisomal 
proliferators (PPs), including hypolipidemic drugs, such as clofibrate, fenofibrate, clofenic acid, 

25 nafenopin, gemfibrozil, ciprofibrate, bezafibrate, halofenate, simfibrate, benzofibrate, etofibrate, WY- 
14,643, and the like; n-alkylcarboxylic acids, such as trichloroacetic acid, valproic acid, hexanoic acid, 
and the like; n-alkylcarboxylic acid precursors, such as trichloroethylene, etrachloroethylene, and the 
like; azoie antifungal compounds, such as bifonazole, and the like; leukotriene D4 antagonists; 
herbicides; pesticides; phthalate esters, such as di-[2-ethylhexyl] phthalate, mono-[2-ethylhexyl] 

30 phthalate, and the like; and natural chemicals, such as phenyl acetate, dehydroepiandrosterone (DHEA), 
oleic acid, methanol, and the like. In another embodiment, the toxic compound is acetaminophen or one 
of its corresponding metabolites. In yet another embodiment, the toxic compounds are polycyclic 
aromatic hydrocarbons (PAHs), including compounds such as benzo(a)pyrene, 3-methylcholanthrene, 
benz(a)anthracene, 7,12-dimethylbenz(a)anthracene, their corresponding metabolites, and the like. Of 
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particular interest is the study of the toxicological responses of these compounds on the liver, kidney, 

brain, spleen, pancreas, and lung. 

Modification of Gene Expression Using Nucleic Acids 

Gene expression may be modified by designing complementary or antisense molecules (DNA, 
5 RNA, or PNA) to the control, 5\ 3\ or other regulatory regions of the. mammalian gene. 

Oligonucleotides designed with reference to the transcription initiation site are preferred. Similarly, 
inhibition can be achieved using triple helix base-pairing which inhibits the binding of polymerases, 
transcription factors, or regulatory molecules (Gee et al. In: Huber and Can* (1994) Molecular and 
Immunologic Approaches . Futura Publishing, Mt. Kisco NY, pp. 163-177). A complementary molecule 

10 may also be designed to block translation by preventing binding between ribosomes and mRNA. In one 
alternative, a library of nucleic acid molecules or fragments thereof may be screened to identify those 
which specifically bind a regulatory, nontranslated sequence . 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 

15 molecule to complementary target RNA followed by endonucleolytic cleavage at sites such as GUA, 
GUU, and GUC. Once such sites are identified, an oligonucleotide with the same sequence may be 
evaluated for secondary structural features which would render the oligonucleotide inoperable. The 
suitability of candidate targets may also be evaluated by testing their hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

20 Complementary nucleic acids and ribozymes of the invention may be prepared via recombinant 

expression, in vitro or in vivo , or using solid phase phosphoramidite chemical synthesis. In addition, 
RNA molecules may be modified to increase intracellular stability and half-life by addition of flanking 
sequences at the 5' and/or 3* ends of the molecule or by the use of phosphorothioate or T O-methyl rather 
than phosphodiesterase linkages within the backbone of the molecule. Modification is inherent in the 

25 production of PNAs and can be extended to other nucleic acid molecules. Either the inclusion of 
nontraditional bases such as inosine, queosine, and wybutosine, and or the modification of adenine, 
cytidine, guanine, thymine, and uridine with acetyl-, methyl-, thio- groups renders the molecule less 
available to endogenous endonucleases. 
Screening Assays 

30 The nucleic acid molecule encoding the mammalian protein may be used to screen a library of 

molecules for specific binding affinity. The libraries may be DNA molecules, RNA molecules, PNAs, 
peptides, proteins such as transcription factors, enhancers, repressors, and other ligands which regulate 
the activity, replication, transcription, or translation of the nucleic acid molecule in the biological system. 
The assay involves combining the mammalian nucleic acid molecule or a fragment thereof with the 
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library of molecules under conditions allowing specific binding, and detecting specific binding to identify 
at least one molecule which specifically binds the nucleic acid molecule. 

Similarly the mammalian protein or a portion thereof may be used to screen libraries of 
molecules in any of a variety of screening assays. The portion of the protein employed in such screening 
5 may be free in solution, affixed to an abiotic or biotic substrate (e.g. borne on a cell surface), or located 
intracellularly. Specific binding between the protein and molecule may be measured. Depending on the 
kind of library being screened, the assay may be used to identify DNA, RNA, or PNA molecules, 
agonists, antagonists, antibodies, immunoglobulins, inhibitors, peptides, proteins, drugs, or any other 
ligand, which specifically binds the protein. One method for high throughput screening using very small 
10 assay volumes and very small amounts of test compound is described in USPN 5,876,946, incorporated 
herein by reference, which screens large numbers of molecules for enzyme inhibition or receptor binding. 
Purification of Ligand 

The nucleic acid molecule or a fragment thereof may be used to purify a ligand from a sample. A 

method for using a mammalian nucleic acid molecule or a fragment thereof to purify a ligand would 
15 involve combining the nucleic acid molecule or a fragment thereof with a sample under conditions to 

allow specific binding, detecting specific binding, recovering the bound protein, and using an appropriate 

agent to separate the nucleic acid molecule from the purified ligand. 

Similarly, the protein or a portion thereof may be used to purify a ligand from a sample. A 

method for using a mammalian protein or a portion thereof to purify a ligand would involve combining 
20 the protein or a portion thereof with a sample under conditions to allow specific binding, detecting 

specific binding between the protein and ligand, recovering the bound ligand, and using an appropriate 

chaotropic agent to separate the protein from the purified ligand. 

Pharmacology 

Pharmaceutical compositions are those substances wherein the active ingredients are contained in 
25 an effective amount to achieve a desired and intended purpose. The determination of an effective dose is 
well within the capability of those skilled in the art. For any compound, the therapeutically effective 
dose may be estimated initially either in cell culture assays or in animal models. The animal model is 
also used to achieve a desirable concentration range and route of administration. Such information may 
then be used to determine useful doses and routes for administration in humans. 
30 A therapeutically effective dose refers to that amount of protein or inhibitor which ameliorates 

the symptoms or condition. Therapeutic efficacy and toxicity of such agents may be determined by 
standard pharmaceutical procedures in cell cultures or experimental animals, e.g., ED 30 (the dose 
therapeutically effective in 50% of the population) and LD 50 (the dose lethal to 50% of the population). 
The dose ratio between toxic and therapeutic effects is the therapeutic index, and it may be expressed as 
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the ratio, LDjo/EDjo. Pharmaceutical compositions which exhibit large therapeutic indexes are preferred. 
The data obtained from cell culture assays and animal studies are used in formulating a range of dosage 
for human use. 

MODEL SYSTEMS 

Animal models may be used as bioassays where they exhibit a toxic response similar to that of 
humans and where exposure conditions are relevant to human exposures. Mammals are the most 
common models, and most toxicity studies are performed on rodents such as rats or mice because of low 
cost, availability, and abundant reference toxicology. Inbred or outbred rodent strains provide a 
convenient model for investigation of the physiological consequences of under- or over-expression of 
genes of interest and for the development of methods for diagnosis and treatment of diseases. A mammal 
inbred to over-express a particular gene, so that the protein is secreted in milk, may also serve as a 
convenient source of the protein expressed by that gene. 
Toxicology 

Toxicology is the study of the effects of test compounds, molecules, or toxic agents on living 
systems to identify adverse effects. The majority of toxicity studies are performed on rats or mice to help 
predict whether adverse effects of agents will occur in humans. Observation of qualitative and 
quantitative changes in physiology, behavior, homeostatic, developmental, and reproductive processes, 
and lethality are used to generate profiles of safe or toxic responses and to assess the consequences on 
human health following exposure to the agent. 

Toxicological tests measure the effects of a single, repeated, or long-term exposure of a subject to 
a substance. Substances may be tested for specific endpoints such as cytotoxicity, mutagenicity, 
carcinogenicity and teratogenicity. Degree of response varies according to the route of exposure (contact, 
ingestion, injection, or inhalation), age, sex, genetic makeup, and health status of the subject. Other tests 
establish the toxicokinetic and toxicodynamic properties of substances. Toxicokinetic studies trace the 
absorption, distribution in subject tissues, metabolism, storage, and excretion of substances. 
Toxicodynamic studies chart biological responses that are consequences of the presence of the substance 
in the subject tissues. 

Genetic toxicology identifies and analyzes the ability of an agent to produce damage at a cellular 
or subcellular level. Such genotoxic agents usually have common chemical or physical properties that 
facilitate interaction with nucleic acids and are most harmful when mutated chromosomes are passed 
along to progeny. Toxicological studies may identify agents that increase the frequency of structural or 
functional abnormalities in progeny if administered to either parent before conception, to the mother 
during pregnancy, or to the developing organism. Mice and rats are most frequently used in these tests 
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because of their short reproductive cycle which allows investigators to breed sufficient quantities of 
individual animals to satisfy statistical requirements. 

All types of toxicology studies on experimental animals involve preparation of a suitable form of 
the compound for administration, selection of the route of administration, and selection of a species 
5 which resembles the species of pharmacological interest. Dose concentrations of the compound are 
varied to identify, measure, and investigate a range of dose-related effects related to exposure. 

Acute toxicity tests are based on a single administration of the agent to the subject to determine 
the symptomology or lethality of the agent. Three experiments are conducted; an experiment to define 
the initial dose range; an experiment to narrow the range of effective doses; and a final experiment to 
10 establish the dose-response curve. 

Prolonged and subchronic toxicity tests are based on the repeated administration of the agent. 
Rat and dog are commonly used in these studies to provide data from species in different taxonomic 
orders. With the exception of carcinogenesis, there is considerable evidence that daily administration of 
an agent at high-dose concentrations for periods of three to four months will reveal most forms of toxicity 
15 in adult animals. 

Chronic toxicity tests, with a duration of a year or more, are used to demonstrate either the 
absence of toxicity or the carcinogenic potential of an agent. When studies are conducted on rats, a 
minimum of at least one test group plus one control group are used. Animals are quarantined, examined 
for health, and monitored at the outset and at intervals throughout the experiment. 
20 Transgenic Animal Models 

Transgenic rodents which over-express or under-express a gene of interest may be inbred and 
used to model human diseases or to test therapeutic or toxic agents. (See USPN 4,736,866; USPN 
5,175,383; and USPN 5,767,337; incorporated herein by reference). In some cases, the introduced gene 
may be activated at a specific time in a specific tissue type during fetal development or postnatally. 
25 Expression of the transgene is monitored by analysis of phenotype or tissue-specific mRNA expression, 
in transgenic animals before, during, and after being challenged with experimental drug therapies. 
Embryonic Stem Cells 

Embryonic stem cells (ES) isolated from rodent embryos retain the potential to form an embryo. 
When ES cells are placed inside a carrier embryo, they resume normal development and contribute to all 
3 0 tissues of the live-born animal. ES cells are the preferred cells used in the creation of experimental 

knockout and knockin rodent strains. Mouse ES cells, such as the mouse 129/SvJ cell line, are derived 
from the early mouse embryo and are grown under culture conditions well known in the art. Vectors for 
knockout strains contain a disease gene candidate modified to include a marker gene which disrupts 
transcription and/or translation of the endogenous disease candidate gene in vivo . The vector is 
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introduced into ES cells by transformation methods such as electroporation, liposome delivery, 
microinjection, and the like which are well known in the art. The endogenous rodent gene is replaced by 
the disrupted disease gene through homologous recombination and integration during cell division. 
Expression of the marker gene confers a selective advantage to the transformed cells when incubated with 
an otherwise toxic/lethal selecting agent. Transformed ES cells are selected, identified, and preferably 
microinjected into mouse cell blastocysts such as those from the C57BL/6 mouse strain. The blastocysts 
are surgically transferred to pseudopregnant dams and the resulting chimeric progeny are genotyped and 
bred to produce heterozygous or homozygous strains. 

ES cells are also used to study the differentiation of various cell types and tissues in vitro , such as 
neural cells, hematopoietic lineages, and cardiomyocytes (Bain et al. (1995) Dev. Biol. 168:342-357; 
Wiles and Keller (1991) Development 1 1 1:259-267; and Klug et al. (1996) J. Clin. Invest. 98:216-224). 
Recent developments demonstrate that ES cells derived from human blastocysts may also be manipulated 
in vitro to differentiate into eight separate cell lineages, including endoderm, mesoderm, and ectodermal 
cell types (Thomson et al. (1998) Science 282:1 145-1 147). 
Knockout Analysis 

In gene knockout analysis, a region of a human disease gene candidate is enzymatically modified 
to include a non-mammalian gene such as the neomycin phosphotransferase gene (neo; Capecchi (1989) 
Science 244:1288-1292). The inserted coding sequence disrupts transcription and translation of the 
targeted gene and prevents biochemical synthesis of the disease candidate protein. The modified gene is 
transformed into cultured embryonic stem cells (described above), the transformed cells are injected into 
rodent blastulae, and the blastulae are implanted into pseudopregnant dams. Transgenic progeny are 
crossbred to obtain homozygous inbred lines. 
Knockin Analysis 

Totipotent ES cells, present in the early stages of embryonic development, can be used to create 
knockin humanized animals (pigs) or transgenic animal models (mice or rats) of human diseases. With 
knockin technology, a region of a human gene is injected into animal ES cells, and the human sequence 
integrates into the animal cell genome by recombination. Totipotent ES cells which contain the 
integrated human gene are handled as described above. Inbred animals are studied and treated to obtain 
information on the analogous human condition. These methods have been used to model several human 
diseases. (See, e.g., Lee et al. (1998) Proc. Natl. Acad. Sci. 95:1 1371-1 1376; Baudoin et al. (1998) Genes 
Dev. 12:1202-1216; and Zhuang et al. (1998) Mol. Cell Biol. 18:3340-3349). 
Non-Human Primate Model 

The field of animal testing deals with data and methodology from basic sciences such as 
physiology, genetics, chemistry, pharmacology and statistics. These data are paramount in evaluating the 
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effects of therapeutic agents on non-human primates as they can be related to human health. Monkeys 
are used as human surrogates in vaccine and drug evaluations, and their responses are relevant to human 
exposures under similar conditions. Cynomolgus and Rhesus monkeys (Macaca fascicularis and Macaca 
mulatta, respectively) and Common Marmosets (Callithrix jacchus) are the most common non-human 
primates (NHPs) used in these investigations. Since great cost is associated with developing and 
maintaining a colony of NHPs, early research and toxicological studies are usually carried out in rodent 
models. In studies using behavioral measures such as drug addiction, NHPs are the first choice test 
animal. In addition, NHPs and individual humans exhibit differential sensitivities to many drugs and 
toxins and can be classified as a range of phenotypes from "extensive metabolizers" to "poor 
metabolizers" of these agents. 

In additional embodiments, the nucleic acid molecules which encode the mammalian protein may 
be used in any molecular biology techniques that have yet to be developed, provided the new techniques 
rely on properties of nucleic acid molecules that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Examples 

It is understood that this invention is not limited to the particular methodology, protocols, and 
reagents described, as these may vary. It is also understood that the terminology used herein is for the 
purpose of describing particular embodiments only, and is not intended to limit the scope of the present 
invention which will be limited only by the appended claims. The examples below are provided to best 
describe the subject invention and its representative constituents. 

I cDNA Library Construction 

The RALINOT01 cDNA library was constructed from liver tissue removed from a pool of fifty 
10- to ll-week-old Sprague-Dawley female rats (Pharmacon, Waverly PA). The animals were housed in 
standard laboratory caging and fed PMI-certified Rodent Diet #5002. The animals appeared to be in good 
health at the time tissue was harvested. The animals were anesthetized by C0 2 inhalation, and then 
cardiocentesis was performed. 

Frozen tissue was homogenized and lysed in TRIZOL reagent (1 g tissue/10 ml TRIZOL; Life 
Technologies), a monophasic solution of phenol and guanidine isothiocyanate, using a POLYTRON 
homogenizer (PT-3000; Brinkmann Instruments, Westbury NY). After a brief incubation on ice, 
chloroform (1 :5 v/v) was mixed with the reagent, and then centrifuged at 1,000 rpm. The upper aqueous 
layer was removed to a fresh tube, and the RNA precipitated with isopropanol, resuspended in DEPC- 
treated water, and treated with DNase I for 25 min at 37 °C. The RNA was re-extracted once with 
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phenol-chloroform, pH 4.7, and precipitated using 0.3 M sodium acetate and 2.5 volumes ethanol. The 
mRNA was then isolated using an OLIGOTEX kit (QIAGEN, Chatsworth CA) and used to construct the 
cDNA library. 

The mRNA was handled according to the recommended protocols in the SUPERSCRIPT plasmid 
system (Life Technologies). The cDNAs were fractionated on a SEPHAROSE CL-4B column 
(Amersham Pharmacia Biotech), and those cDNAs exceeding 400 bp were ligated into the pINCYl 
plasmid vector (Incyte Pharmaceuticals). The plasmid pINCYl was subsequently transformed into DH5a 
or DH10B competent cells (Life Technologies). 

The RAKINOT01 library was constructed using mRNA isolated from kidney tissue removed 
from a pool of fifty, 7- to 8-week-old male Sprague-Dawley rats, as described above. 

The RAKINOT02 library was constructed using mRNA isolated from kidney tissue removed 
from a poo! of fifty, 10- to 1 1-week-old female Sprague-Dawley rats, as described above. 

II cDNA Library Normalization 

In some cases, cDNA libraries were normalized in a single round according to the procedure of 
Soares et al. (1994, Proc. Natl. Acad. Sci. 91 :9228-9232) with the following modifications. The primer 
to template ratio in the primer extension reaction was increased from 2: 1 to 10: 1 . Reduction of each 
dNTP concentration in the reaction to 150//M allowed the generation of longer (400-1000 nucleotide 
(nt)) primer extension products. The reannealing hybridization was extended from 13 to 19 hours. The 
single stranded DNA circles of the normalized library were purified by hydroxy apatite chromatography, 
converted to partially double-stranded by random priming, and electroporated into DH10B competent 
bacteria (Life Technologies). 

The Soares normalization procedure is designed to reduce the initial variation in individual 
cDNA frequencies and to achieve abundances within one order of magnitude while maintaining the 
overall sequence complexity of the library. In the normalization process, the prevalence of high- 
abundance cDNA clones decreases significantly, clones with mid-level abundance are relatively 
unaffected, and clones for rare transcripts are increased in abundance. In the modified Soares 
normalization procedure, significantly longer hybridization times are used to increase gene discovery 
rates by biasing the normalized libraries toward low-abundance cDNAs that are well represented in a 
standard transcript image. 

The RALINON03, RALINON04, and RALINON07 normalized rat liver cDNA libraries were 
constructed with 2.0 x 10 6 , 4.6 x 10 5 , and 2.0 x 10 6 independent clones from the RALINOT01 cDNA 
library, respectively. The RALINOT01 cDNA library was normalized in one round using conditions 
adapted from Soares (supra) except that a significantly longer (48-hour) reannealing hybridization was 
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used. 

III cDNA Library Prehybridization 

The RALINOH01 cDNA library was constructed with clones from the RALINOT01 cDNA 
5 library. After preparation of the RALINOT01 cDNA library, 9,984 clones were spotted onto a nylon 
filter, lysed, and the plasmid DNA was bound to the filter. The filter was incubated with pre-warmed 
hybridization buffer and then hybridized at 42°C for 14-16 hours in 0.75 M NaCl, 0.1 M 
Na^HPCyNaH^O^ 0.15 M tris-HCl (pH 7.5), 5x Denhardfs Solution, 2% SDS, 100 ug/ml sheared 
salmon sperm DNA, 50% formamide, and [ 32 P]-labeled oligonucleotide molecules made from reverse 

10 transcribed rat liver mRNA from an untreated animal. The filter was rinsed with 2 x SSC (saline sodium 
citrate) at ambient temperature for 5 minutes followed by washing for 30 minutes at 68°C with pre- 
warmed washing solution (2 x SSC, 1% SDS). The wash was repeated with fresh washing solution for an 
additional 30 minutes at 68°C. Filters were then washed twice with pre-warmed washing solution (0.6 x 
SSC, 1% SDS) for 30 minutes at 68°C. Some 4,224 clones had very low hybridization signals and about 

15 20% of the clones had no signals and two groups were isolated and sequenced. 

IV Isolation and Sequencing of cDNA Clones 

DNA was isolated using the following protocol. Single bacterial colonies were transferred into 
individual wells of 384-well plates (Genetix Ltd, Christchurch, United Kingdom) using sterile toothpicks. 

20 The wells contained 1 ml of sterile Terrific Broth (Life Technologies) with 25 mg/1 carbenicillin and 

0.4% glycerol (v/v). The plates were covered and placed in an incubator (Thermodyne, Newtown Square 
PA) at 37°C for 8-10 hours. Plasmid DNA was released from the cells and amplified using direct link 
PCR (Rao, V.B. (1994) Anal. Biochem. 216: 1-14) as follows. The direct link PCR solution included 30 
ml of NUCLEIX PLUS PCR nucleotide mix (Amersham Pharmacia Biotech, Piscataway NJ) and 300 ul 

25 of Taq DNA polymerase (Amersham Pharmacia Biotech). Five microlitres of the PCR solution were 
added to each of the 384 wells using the MICROLAB 2200 system (Hamilton, Reno NV); plates were 
centrifuged at 1000 rpm for 20 seconds and refrigerated until use. A 384 pin tool (V&P Scientific Inc, 
San Diego CA) was used to transfer bacterial cells from the incubation plate into the plate containing the 
PCR solution where 0.1% Tween 20 caused the cells to undergo lysis and release the plasmid DNA. 

3 0 After lysis, the plates were centrifuged up to 500 rpm, covered with a cycle sealer, and cycled using a 

384-well DNA ENGINE thermal cycler (MJ Research, Watertown MA) using the program dPCR30 with 
the following parameters: Step 1) 95 °C, 1 minute; Step 2) 94 °C, 30 seconds; Step 3) 55 °C, 30 seconds; 
Step 4) 72°C, 2 minutes; Step 5) steps 2, 3, and 4 repeated 29 times; Step 6) 72 °C, 10 minutes; and Step 
7) storage at4°C. 



24 



WO 01/36684 



PCT/US00/31743 



The concentration of DNA in each well was determined by dispensing 100 \x\ PICO GREEN 
quantitation reagent (0.25% (v/v), Molecular Probes, Eugene OR) dissolved in lxTE and 0.5 ^1 of 
undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, Acton MA), 
allowing the DNA to bind to the quantitation reagent. The plate was scanned in a Fluoroscan II 
5 (Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantitate the 
concentration of DNA. Typical concentrations of each DNA sample were in the range of 100 to 500 
ng/ml. 

The cDNAs were prepared for sequencing using either a HYDRA microdispenser (Robbins 
Scientific, Sunnyvale CA) or MICROLAB 2200 system (Hamilton) in combination with the DNA 

10 ENGINE thermal cyclers (MJ Research). The cDNAs were sequenced using the method of Sanger, F. 
and A.R. Coulson (J. Mol. Biol. (1975) 94:441-448) and the ABI 377 sequencing systems (PE 
Biosystems). Most of the isolates were sequenced according to standard ABI protocols using ABI kits 
(PE Biosystems). The solution volumes were used at 0.25x - l.Ox concentrations. Typically, 500 to 700 
base pairs were sequenced in 3.5 to 4 hours. In the alternative, cDNAs may have been sequenced using 

15 solutions and dyes from Amersham Pharmacia Biotech. 

V Rat Liver and Kidney Gene Selection 

As a first step, originator molecules from high throughput sequencing experiments were derived 
from clone inserts from RALINOT01, RAKINOT01, RAKINOT02, RALINOH01, RALINON03, 
20 RALINON04 and RALINON07. cDNA library clones were obtained. There were 18,140 rat liver 
molecules and 5,779 rat kidney molecules. 

Additionally, 1,500 rat molecules derived from clone inserts of any of 1 13 rat cDNA libraries 
were selected based on their homology to genes coding for polypeptides implicated in toxicological 
responses including peroxi some-associated genes, lysosome-associated genes, apoptosis-associated 
25 genes, cytochrome P450 genes, detoxification genes such as sulfotransf erases, glutathione S-transferases, 
and cysteine proteases, and the like. 

Then, all the remaining molecules derived from all of the rat cDNA library clones were clustered 
based on the originator molecules described above. The clustering process involved identifying 
overlapping molecules that have a match quality indicated by a product score of 50 using BLAST. 
30 6581 master clusters were identified. 

After forming the clone clusters, a consensus sequence was generated based on the assembly of 
the clone molecules using PHRAP (Phil Green, University of Washington). The assembled molecules 
were then annotated by first screening the assembled molecules against GenBank using BLASTn and 
then by screening the assembled molecules against GenPept using FASTX. About two thirds of the 
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assembled molecules were annotated, about one third of the assembled molecules were not annotated. 
For example, for nucleic acid sequence analysis, the program BLASTN 1 .4.9MP-WashU was used with 
default parameters; ctxfactor=2.00; E=10; MatID, 0; Matrix name, +5,-4. In another example, for amino 
acid sequence analysis, the program NCBI-BLASTX 2.0.4 was used with default parameters; matrix, 
5 BLOSUM62; gap penalties, existence 1 1 , extension 1 ; frameshift window, decay constant 50, 0. 1 . 

VI Substrate and Array Element/Probe Preparation 

Clones nominated in the process described in Example V were used to generate array elements. 
Each array element was amplified from bacterial cells. PCR amplification used primers complementary 

10 to the vector sequences flanking the cDNA insert. Array elements were amplified in thirty cycles of PCR 
from an initial quantity of 1-2 ng to a final quantity greater than 5 \ig. Amplified array elements were 
then purified using SEPHACRYL-400 (Amersham Pharmacia Biotech). 

Purified array elements were immobilized on polymer-coated glass slides. Glass microscope 
slides (Corning, Corning NY) cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled 

15 water washes between and after treatments. Glass slides were etched in 4% hydrofluoric acid (VWR, 
West Chester PA), washed extensively in distilled water, and coated with 0.05% aminopropyl silane 
(Sigma-Aldrich, St. Louis MO) in 95% ethanol. Coated slides were cured in a 1 10°C oven. 

Array elements were applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522 and incorporated herein by reference. In brief, 1 ul of the array element DNA, at an 

20 average concentration of 0.5 ^ig/ml in 3 x SSC, was loaded into the open capillary printing element by a 
high-speed robotic apparatus. The apparatus then deposited about 5 nl of the array element sample per 
slide. A total of 7404 array elements representing rat liver and kidney genes and a variety of control 
elements, including 14 synthetic control molecules, human genomic DNA, and yeast genomic DNA, were 
arrayed in four identical quadrants within a 1 .8 cm 2 area of the glass substrate. 

25 Microarrays were UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 

Microarrays were washed at room temperature once in 0.2% SDS and three times in distilled water. Non- 
specific binding sites were blocked by incubation of microarrays in 0.2% casein in phosphate buffered 
saline (PBS; Tropix Inc., Bedford MA) for 30 minutes at 60°C followed by washes in 0.2% SDS and 
distilled water as before. 

30 

VII Target Preparation 

Male Sprague-Dawley rats (6-8 wk old) were dosed intraperitoneal ly with one of the following: 
clofibrate (CLO; Acros, Geel, Belgium) at 250 mg/kg body weight (bw); acetaminophen (APAP; Acros) 
at 1000 mg/kg bw; benzo(a)pyrene (B(a)P; Acros) at 10 mg/kg bw; or dimethylsulfoxide vehicle (DMSO; 
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Acros) at less than 2 ml/kg bw, and the animals were later euthanized by C0 2 inhalation. Animals were 
monitored daily for physical condition and body weight. Three animals per group were sacrificed 
approximately 12 hours, 24 hours, 3d (d), 7d, 14d, and 28d following the single dose. Prior to sacrifice a 
blood sample from each animal was drawn and assayed for serum alanine transferase (ALT) and serum 
5 aspartate aminotransferase (AST) levels using a diagnostic kit (Sigma-Aldrich). Observed gross 

pathology and liver weights were recorded at time of necropsy. Liver, kidney, brain, spleen and pancreas 
from each rat were harvested, flash frozen in liquid nitrogen, and stored at -80°C. 

In the alternative, male Han-Wistar rats (8-9 wk old) were dosed by oral gavage with one of the 
following: fenofibrate (FEN; Sigma-Aldrich) at 250 mg/kg bw; carbon tetrachloride (CCL 4 ; Sigma- 

10 Aldrich) at 3 160 mg/kg bw, hydrazine (HYDR; Sigma-Aldrich) at 1 20 mg/kg bw; a- 

naphthylisothiocyanate (ANIT; Sigma-Aldrich) at 200 mg/kg bw; 4-acetylaminofluorene (4-AFF; 
Lancaster Synthesis, Morecambe, Lancashire, UK) at 1000 mg/kg bw; corn oil vehicle, or sterile water 
vehicle, at 10 ml/kg bw. The animals were checked twice daily for clinical signs of distress. Blood was 
collected six days prior to the dose and at sacrifice. Three animals per group were sacrificed 

15 approximately six hours and 24 hours following the single dose. The animals were euthanized by 

exsanguination under isoflurane anaesthesia. Observed gross pathology and liver weights were recorded 
at time of necropsy. Livers from each rat were harvested, dissected into approximate 100 mg pieces, 
flash frozen in liquid nitrogen, and stored at -70°C. 

For each target preparation, frozen liver was homogenized and lysed in TRIZOL reagent (Life 

20 Technologies, Gaithersburg MD) following the modifications for liver RNA isolation. Messenger RNA 
was isolated using an OLIGOTEX kit (QIAGEN) and labeled with either Cy3- or Cy5-labeled primers 
(Operon Technologies, Alameda CA) using the GEMB RIGHT labeling kit (Incyte Pharmaceuticals). 
Messenger RNA isolated from tissues of rats treated with clofibrate, acetaminophen, or benzo(a)pyrene 
was labeled with Cy5 and mRNA isolated from tissues of rats treated with DMSO was labeled with Cy3. 

25 Quantitative and differential expression pattern control cDNAs were added to each labeling reaction. 

Labeled cDNA was treated with 0.5 M sodium bicarbonate (pH 9.2) for 20 min at 85 °C to degrade the 
RNA and purified using two successive CHROMA SPIN 30 gel filtration spin columns (Clontech, Palo 
Alto CA). Cy3-labeled control sample and Cy5-labeled experimental sample were combined and 
precipitated in glycogen, sodium acetate, and ethanol. 

30 Targets are also prepared from tissue needle biopsy samples. Samples are used to identify 

changes within the tissue following exposure to, for example, a toxic compound, a potential toxic 
compound, a compound with unknown metabolic responses, and a pharmacological compound. 
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VIII Hybridization 

Hybridizations were carried out using the methods described by Shalon (supra). 

IX Detection 

5 The microscope used to detect the reporter-labeled hybridization complexes was equipped with 

an Innova 70 mixed gas 10 W laser (Coherent Lasers, Santa Clara CA) capable of generating spectral 
lines at 488 nm for excitation of Cy3, and 632 nm for excitation of Cy5. The excitation laser light was 
focused on the array using a 20x microscope objective (Nikon, Melville NY). The slide containing the 
array was placed on a computer-controlled X-Y stage on the microscope and raster-scanned past the 
1 o objective. The 1 .8 cm x 1.8 cm array used in the present example was scanned with a resolution of 20 
micrometers. 

In two separate scans, a mixed gas multiline laser excited the two fluorophores sequentially. 
Emitted light was split, based on wavelength, into two photomultiplier tube detectors (PMT RI477, 
Hamamatsu Photonics, San Jose CA) corresponding to the two fluorophores. Appropriate filters 

15 positioned between the array and the photomultiplier tubes were used to filter the signals. The emission 
maxima of the fluorophores used were 565 nm for Cy3 and 650 nm for Cy5. Each array was typically 
scanned twice, one scan per fluorophore using the appropriate filters at the laser source, although the 
apparatus was capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans was typically calibrated using the signal intensity generated by a 

20 cDNA control species added to the probe mix at a known concentration. A specific location on the array 
contained a complementary DNA sequence, allowing the intensity of the signal at that location to be 
correlated with a weight ratio of hybridizing species of 1:100,000. When two probes from different 
sources (e.g., representing test and control cells), each labeled with a different fluorophore, are 
hybridized to a single array for the purpose of identifying genes that are differentially expressed, the 

25 calibration was done by labeling samples of the calibrating cDNA with the two fluorophores and adding 
identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube was digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Norwood MA) installed in an IBM-compatible PC computer. 
The digitized data were displayed as an image where the signal intensity was mapped using a linear 20- 

30 color transformation to a pseudocolor scale ranging from blue (low signal) to red (high signal). The data 
was also analyzed quantitatively. Where two different fluorophores were excited and measured 
simultaneously, the data were first corrected for optical crosstalk (due to overlapping emission spectra) 
between the fluorophores using each fluorophore 's emission spectrum. 

A grid was superimposed over the fluorescence signal image such that the signal from each spot 
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was centered in each element of the grid. The fluorescence signal within each element was then 
integrated to obtain a numerical value corresponding to the average intensity of the signal. The software 
used for signal analysis was the GEMTOOLS gene expression analysis program (Incyte 
Pharmaceuticals). . 

X Results 

The expression patterns of eight cytochrome P450 isozymes known to be induced in a 
toxicological response were monitored during the 28 day time course. The results using clofibrate, 
acetaminophen, and benzo(a)pyrene are shown in Table 1, Table 2, and Table 3, respectively. Each of 
the known genes was upregulated or downregulated greater than 2-fold at least once during the time 
course. 



TABLE 1 Gene expression patterns (x-fold change) of known genes in clofibrate-treated rat liver 



Gene 


1 2 hours 


24 hours 


3 days 


7 days 


28 days 


P450 LA-omega 4A3 


14.8 


26.6 


1.1 


0.5 


0.47 


P450 4A 


7.0 


16.6 


1.4 


0.5 


1.3 


P450 3A2 


0.14 


1.2 


0.63 


0.50 


0.45 



TABLE 2 Gene expression patterns (x-fold change) of known genes in acetaminophen-treated rat liver 



Gene 


1 2 hours 


24 hours 


3 days 


7 days 


14 days 


28 days 


P450 4A 


1.0 


4.5 


2.1 


2.0 


4.4 


4.8 


P450f2C7 


0.21 


0.43 


0.47 


0.5 


1.2 


1.3 


P450 14DM 


0.31 


0.20 


2.0 


1.1 


1.4 


0.42 


TABLE 3 Gene expression patterns (x-fold change) of known genes in benzo(a)pyrene-treated rat liver 


Gene 


12 hours 


24 hours 


3 days 


7 days 


14 days 


28 days 


P450 LA-omega 4A3 


1.2 


2.3 


2.4 


1.4 


6.8 


1.2 


P450MCA-inducible 1A2 


7.3 


9.2 


5.7 


2.5 


2.5 


0.5 



In addition, results from two samples that had been treated identically were compared to 
determine the range of normal variation of gene expression patterns between the samples. In one 
analysis, where two different samples were prepared from identically treated tissues, gene expression 
patterns of cDNAs which were upregulated or downregulated not more than 1 .7-fold were within the 95% 
confidence limits of a Poisson normal distribution. In a separate analysis, gene expression patterns of 
' cDNAs which were upregulated or downregulated more than 2-fold accounted for not more than 5% of 
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the total hybridizable sample nucleic acid molecules in two identically-treated tissue samples. 

We have discovered novel nucleotide molecules that are up-regulated or down-regulated at least 
2-fold at least once during the time course. These molecules are SEQ ID NOs: 1- 1 6 provided in the 
Sequence Listing. These polynucleotide molecules can be used for screening compounds or therapeutics 
for a toxicologic effect and applications including detecting metabolic and toxicological responses, and in 
monitoring drug mechanism of action. 

Table 4 shows the gene expression pattern of selected molecules that were upregulated at least 2- 
fold at least once during the time course following treatment with clofibrate (CLO). Table 5 shows the 
gene expression pattern of selected molecules that were downregulated at least 2-fold at least once during 
the time course following treatment with CLO. 



TABLE 4 Gene expression patterns (x-fold change) of CLO-upregulated nucleic acid molecules 



SEQ ID NO: 


12 hours 


24 hours 


3 days 


7 days 


28 days 


2 


2.6 


1.4 


0.5 


1.1 


1.2 


3 


1.3 


2 


1.3 


1.5 


1.5 


4 


2 


0.36 


0.47 


0.26 


0.30 


5 


1.7 


2.9 


1.6 


1.5 


1.2 


8 


2.6 


1.7 


1.3 


1.3 


1.4 



TABLE 5 Gene expression patterns (x-fold change) of CLO-downregulated nucleic acid molecules 



SEQ ID NO: 


12 hours 


24 hours 


3 days 


7 days 


28 days 


1 


n.d. 


0.26 


0.45 


0.26 


1.1 


4 


2.0 


036 


0.47 


0.26 


0.30 


7 


0.24 


0.42 


0.37 


1.1 


1.5 



(n.d. = not detected) 



Table 6 shows the gene expression pattern of selected molecules that were upregulated at least 2- 
fold at least once during the time course following treatment with acetaminophen (APAP). Table 7 
shows the gene expression pattern of selected molecules that were downregulated at least 2-fold at least 
once during the time course following treatment with APAP. 
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TABLE 6 Gene expression patterns (x-fold change) of APAP-upregulated nucleic acid molecules 



5 



SEQ ID NO: 


12 hours 


24 hours 


3 days 


7 days 


1 4 days 


28 days 


2 


1.3 


2.2 


1.1 


0.5 


1.2 


1.3 


3 


1.2 


2.1 


0.47 


0.46 


1.8 


1.5 


4 


3.3 


0.47 


0.47 


0.23 


0.35 


0.36 


5 


1.1 


2.1 


1.1 


1.2 


1.3 


1.4 


6 


1.8 


5 


2.5 


1.1 


1.4 


1.3 


8 


1.1 


2.5 


1.1 


1 


1.7 


1.4 



10 TABLE 7 Gene expression patterns (x-fold change) of APAP-downregulated nucleic acid molecules 



SEQ ID NO: 


1 2 hours 


24 hours 


3 days 


7 days 


14 days 


28 days 


1 


0.36 


0.19 


0.46 


0.25 


0.5 


1.4 


4 


3.3 


0.48 


0.47 


0.23 


0.35 


0.36 


7 


0.33 


0.21 


1.7 


n.d. 


1 


0.39 



15 (n.d. = not detected) 

Table 8 shows the gene expression pattern of selected molecules that were upregulated at least 2- 
fold at least once during the time course following treatment with benzo(a)pyrene (B(a)P). Table 9 
shows the gene expression pattern of selected molecules that were downregulated at least 2-fold at least 
2 0 once during the time course following treatment with B(a)P. 



TABLE 8 Gene expression patterns (x-fold change) of B(a)P-upregulated nucleic acid molecules 



SEQ ID NO: 


1 2 hours 


1 day 


3 days 


7 days 


14 days 


28 days 


2 


0.5 


0.47 


1.2 


1.1 


2.6 


0.47 


3 


1.4 


2.1 


1.2 


1.5 


2.7 


1.6 


5 


1.5 


1.4 


1.2 


0.47 


2 


0.46 


6 


2.2 


1.4 


1.4 


1.2 


2.2 


n.d. 


7 


1.2 


2.2 


1.4 


0.5 


0.42 


l.l 


8 


1.6 


1.7 


1.3 


1.3 


2 


1.1 



(n.d. = not detected) 
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TABLE 9 Gene expression patterns (x-fold change) of B(a)P-downregulated nucleic acid molecules 



SEQ ID NO: 


12 hours 


1 day 


3 days 


7 days 


14 days 


28 


1 


0.37 


0.39 


0.35 


1.4 


0.33 


1.5 


4 


0.5 


0.26 


0.31 


0.36 


0.47 


n.d. 



5 (n.d. - not detected) 

Table 10 shows the library abundance of selected molecules that were up- or down-regulated at 
least once following treatment with various agents. Library abundance of each SEQ ID NO is presented 
as relative to that library which included the least abundant levels of nucleic acid molecule (SEQ ID NO) 
10 present. 



TABLE 10 Library abundance (least abundant = 1) patterns of nucleic acid molecules 



SEQ ID 

NO: 


Untreated 


CLO 


FEN 


APAP 


BaP 


CCI 4 


HYDR 


ANIT 


4-AAF 


8 


4 


7 


6 


3 


9 


4 


1 


I 


3 


9 


13 


5 


6 


4 


15 


5 


6 


6 


2 


10 


n.d. 


1 


8 


3 


n.d. 


n.d. 


n.d. 


I 


n.d. 


11 


5 


2 


4 


8 


20 


7 


10 


n.d. 


2 



(n.d. = not detected) 



20 

XI Identification and Analyses of Homologous Molecule in other Organisms 

The rat sequences (SEQ ID NOs:l-16) were used to identify additional sequences in the 
ZOOSEQ and LIFESEQ databases (Incyte Pharmaceuticals) related to rat nucleic acid molecules 
regulated during toxicological response (SEQ ID NOs: 18-47). 

2 5 The first pass cDNAs, SEQ ID NOs:5, and 60 through 1 34, were assembled using PHRAP (Phil 

Green, supra) , using the following default parameters, to produce the contiguous sequence SEQ ID 
NO: 135. Mismatch penalty = -2; gap initiation penalty <0; gap extension penalty <0; minimum length of 
matching word = 14; minimum SWAT score = 30; bandwidth = 14; use raw SW scores, "No"; index 
word size = 10; maximum gap size = 30; number of initial bases to be converted to *N\ 0; vector segment 

30 length = 60; Mismatch penalty for scoring degenerate end sequence = -2; Min. score for converting 
degenerate end sequence to 'N\ 20; Minimum size of confirming segment = 8; Amount by which 
confirming segments are trimmed = 1 ; Penalty for confirming matches = -5; Min. SWAT score for 
confirming matches = 30; LLR cutoff for displaying discrepancies = 20; Minimum segment size = 8; 
Spacing between nodes = 4; Split/reassemble initial 'greedy' assembly, "No". 
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Translation of SEQ ID NO: 135 using MACDNASIS PRO software (version 1.0, Hitachi 
Software Engineering) using default parameters of the program elucidated the putative protein coding 
region, SEQ ID NO: 136. The nucleic acid and amino acid sequences were queried against databases such 
as the LIFESEQ (Incyte), GenBank, and SwissProt databases using BLAST. Motifs, HMM algorithms, 
and alignments with BLOCKS, PRINTS, Prosite, and PFAM databases were used to perform functional 
analyses; the antigenic index (Jameson-Wolf analysis) was determined using LASERGENE software 
(version t.62dl, DNASTAR). BLAST2 analysis of SEQ IDNOs:135 and 136 using the human EST 
LIFESEQ database (Incyte) identified Incyte Clone Numbers 746355H1 (SEQ ID NO: 137) and 
1294663H1 (SEQ ID NO: 138) which were assembled with their respective clustered clones to produce 
SEQ ID NOs:37 and 38 which encoded SEQ ID NOs:51 and 52, respectively. 

Functional analysis of SEQ ID NO: 136 using BLOCKS, PRINTS, Prosite, PFAM, Motifs, and 
HMM algorithms identified a potential protein kinase C phosphorylation site at residue S84 (Motifs); a 
potential signal peptide from residue Ml through residue A33 (SPScan); a potential transmembrane 
domain from residue P37 through residue L56 (HMM TM), a sodium/neurotransmitter symporter 
signature from residue G34 through A53, a sodium/alanine symporter signature from G34 through A53, 
and an asparaginase/glutaminase family signature from residue W64 through residue G75 (BOCKS and 
PRINTS). 

Functional analysis of SEQ ID NO:5 1 using BLOCKS, PRINTS, Prosite, PFAM, Motifs, and 
HMM algorithms identified a potential protein kinase C phosphorylation site at residue S83 (Motifs); a 
potential signal peptide sequence from residue Ml through residue A52 (SPScan); a sodium/alanine 
symporter signature from residue G33 through residue A52, an asparaginase/glutaminase family signature 
from residue W63 through residue G74, and a channel-forming colicin domain from residue K31 through 
residue G49 (BLOCKS and PRINTS). Functional analysis of SEQ ID NO:52 using BLOCKS, PRINTS, 
Prosite, PFAM, Motifs, and HMM algorithms identified a potential signal peptide sequence from residue 
Ml through A53 (SPScan); a sodium/alanine symporter signature from residue G34 through residue A53, 
a 6-phosphogluconate dehydrogenase family signature from residue G15 through residue A40, an FAD- 
dependant glycerol-3-phosphate dehydrogenase family signature from residue Y18 through residue Y30, 
and a vacuolar ATP synthetase 16 kDa subunit signature from residue L39 through residue G65 
(BLOCKS and PRINTS). 
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CLAIMS 

What is claimed is: 

1 . A method for detecting or diagnosing the effect of a toxic compound or molecule associated with 
increased or decreased levels of nucleic acid molecules in a mammalian subject comprising: 

a) treating a mammalian subject with a toxic compound or molecule; 

b) obtaining a sample containing nucleic acids from the mammalian subject treated with the 
toxic compound or molecule; 

c) contacting the sample with a microarray comprising a plurality of nucleic acid molecules 
of SEQ ID NOs:l-47, or a fragment thereof under conditions for the formation of one or 
more hybridization complexes; and 

d) detecting the hybridization complexes, wherein the presence, absence or change in amount of 
the hybridization complex, as compared with the hybridization complexes formed from nucleic 
acid molecules from an untreated mammalian subject, is indicative of a metabolic response to the 
toxic compound or molecule. 

2. The method of claim 1 wherein: 

a) the sample is a tissue chosen from liver, kidney, brain, spleen, pancreas, and lung; 

b) the sample is liver tissue; 

c) the toxic compound or molecule which elicits the metabolic response induces at least a 
2-fold change in the amount of at least one of the nucleic acid molecules of the sample; 

d) the toxic compound is a peroxisome proliferator; 

e) the toxic compound is a hypolipidemic drug; and 

0 the toxic compound is clofibrate or one of its corresponding metabolites. 

3. The method of claim 1 wherein: 

a) the sample is a tissue chosen from liver, kidney, brain, spleen, pancreas, and lung; 

b) the sample is liver tissue; 

c) the toxic compound or molecule which elicits the metabolic response induces at least a 
2-fold change in the amount of at least one of the nucleic acid molecules of the sample; 

d) the toxic compound is acetominophen or one of its corresponding metabolites. 

4. The method of claim 1 wherein: 

a) the sample is a tissue chosen from liver, kidney, brain, spleen, pancreas, and lung; 

b) the sample is liver tissue; 

c) the toxic compound or molecule which elicits the metabolic response induces at least a 
2-fold change in the amount of at least one of the nucleic acid molecules of the sample; 

d) the toxic compound is a polycyclic aromatic hydrocarbon; 
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e) the toxic compound is a diol epoxide; and 

f) the toxic compound is benzo(a)pyrene, or one of its corresponding metabolites. 

5. A method for detecting or diagnosing a toxicological response to a test compound or molecule in 
a mammalian subject, the method comprising: 

a) treating a mammalian subject with a test compound or molecule; 

b) obtaining a sample containing nucleic acids from the mammalian subject treated with the test 
compound or molecule; 

c) contacting the sample with a microarray comprising a plurality of nucleic acid molecules 
of SEQ ID NOs: 1-47, or a fragment thereof, under conditions for the formation of one or more 
hybridization complexes; 

d) detecting the hybridization complexes, wherein the presence, absence or change in amount of 
the hybridization complex, as compared with the hybridization complexes formed from nucleic 
acid molecules from a normal or untreated mammalian subject, is indicative of a toxic response 
to the test compound or molecule. 

6. The method of claim 5 wherein the test compound which elicits the metabolic response is a 
compound with a previously known metabolic response. 

7. The method of claim 5 wherein the test compound which elicits the metabolic response is a 
compound with a previously unknown metabolic response. 

8. An isolated and purified nucleic acid molecule selected from SEQ ID NOs: 1-11, 17-33, 36, 39, 
and 41, or a fragment thereof, wherein said fragments are at least 60 contiguous nucleotides in length. 

9. A method of using a molecule selected from SEQ ID NOs: 1-59 or a fragment thereof to screen a 
library of molecules or compounds to identify at least one molecule or compound which specifically 
binds the selected molecule, the method comprising: 

a) combining the selected molecule with a library of molecules or compounds under conditions 
to allow specific binding; and 

b) detecting specific binding, thereby identifying a molecule or compound which specifically 
binds the selected molecule. 

10. The method of claim 9 wherein the library is selected from DNA molecules, RNA molecules, 
peptide nucleic acids, artificial chromosome constructions, peptides, proteins, and drugs. 

11. An isolated and purified antibody identified using the method of claim 9. 

12. An isolated and purified nucleic acid molecule variant having at least 70% nucleic acid sequence 
identity to the nucleic acid molecule of claim 8. 

13. An isolated and purified nucleic acid molecule having a sequence which is complementary to the 
nucleic acid molecule of claim 8. 
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14. An isolated and purified agonist identified using the method of claim 9. 

15. An isolated and purified antagonist identified using the method of claim 9. 

16. An expression vector comprising at least a fragment of the nucleic acid molecule of claim 8. 

17. A host cell comprising the expression vector of claim 16. 

1 8. A method for producing a polypeptide, the method comprising the steps of: 

a) culturing the host cell of claim 17 under conditions suitable for the expression of the 
polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

19. An isolated and purified protein molecule encoded by the nucleic acid molecule selected from 
SEQ ID NOs:l-l 1, 17-33, 36, 39, and 41, an isolated and purified protein molecule of SEQ ID NOs:50 
and 53, or a portion thereof, wherein said portions encode at least 20 contiguous amino acids in length. 

20. A pharmaceutical composition comprising the protein molecule of claim 19 in conjunction with a 
suitable pharmaceutical carrier. 
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SEQUENCE LISTING 



<110> INCYTE GENOMICS, INC. 
CUNNINGHAM, Mary Jane 
ZWEIGER, Gary 
KASER, Matthew R. 
PANZER , Scott 
SEILHAMMER, Jeffrey J. 
YUE, Henry 
BAUGHN, Mariah 
AZIMZAI, Yalda 
LAL, Preeti 

<120> MAMMALIAN TOXICOLOGICAL RESPONSE MARKERS 

<130> PC-0007 PCT 

<140> To Be Assigned 
<141> Herewith 

<!150> 09/443,184 
<151> 1999-11-19 

<160> 138 

<170> PERL Program 

<210> 1 
<211> 285 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700305024H2 

<400> 1 

agagttctag cctcacttta agatgcttct ttctctcaga attaaaggac tcgttttact 60 
aagcgtantt ccaaagcatg ttacttacat tccttcttgc tatccacaga cctggtaatt 120 
aactctatca catggtttct actctctaat ggagaacagg agaaaaatga gtcccaagct 180 
tcccaatcag aattttaaat cttgactttt tttcccaaat catttaactg gagatgaaca 240 
gaccaaggca ggaaaaagaa aacaaggttc tagagatcat ttgac 285 

<210> 2 

<211> 291 

<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> mi sc_f eature 

<223> Incyte ID No.: 700306220H1 



<400> 2 

ggctgcggtg ccttcggtcg cgcgtacacg 
ttttacgttc gcggcgggga aaacagcctg 
ggcgctgtgc cttacggtga acgccggaaa 
gcatgtgaaa ggtgatgtca gcatttctgt 
ttctgagagt gtggtgttca ctgacacaaa 

<210> 3 
<211> 293 
<212> DNA 



ttgcatctcc tagcttcctc ctgaaccccg 60 
acgagtagac tgcagctcct gggagatggc 120 
ccctccactg gaagctctgc tggcagtgga 180 
ggaagaaggg aaggagaatc ttcttcgggt 240 
ttcaatcctg cgctacttgg c 291 
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<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700510669H1 

<400> 3 

aggacctgtc cttacatatt gtggcctgaa 
acaattccac tgtttatttt ccttggtgct 
tttgaaattt tggcctactt tatttcagac 
tgatatattt gagagtggga aggaatctct 
aatggactag gcttaggatt acaatgaacc 



gggacaaaat atgaggagtt naatannagg 60 
aaattaaaga atcaagccct tgttcgagcc 120 
actcaaaata caaatgccaa caaatggtnc 180 
gatgtttaaa tttcactgtt gatctttcaa 240 
ttttgtcctt tgtcagtgtt teg 293 



<210> 4 
<211> 260 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700525676H1 

<400> 4 

geagctegga ctagtcagag gnctctggcg 
cccacagtga cggangcccg ggtgcgaggg 
tgtggtatac tgatgaacag aacccttgca 
tgtgcaagaa ccttcttcag aatcactaca 
gagaggagaa ttatagggta 



agggtggcat egggatgecg tccgaagtca 60 
tctgcgcgca aegtcaggta cttagctccc 120 
ccgtgggcct cagttttgee taaagagatc 180 
ccattagtaa ataagegana ggagtattca 240 

260 



<210> 5 
<211> 290 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> mi sc_f eature 

<223> Incyte ID No.: 700535332H1 

<400> 5 

aagggccagt tgcatccgca cccagtgctt 
agaggectgt gtgttcttaa ctgctccgac 
tctccaaagt gactgaacaa tgcagaagga 
tggtttcggc tatgcggccc tggtggctac 
tatgtgccgt ccctggctgc tggatcttct 



gtaccttgaa ctcatttctt cctgactgct 60 

ctctcctcca caggtgeagg cctggtgtgg 120 

cagtggccca ctggttcctt tacattatna 180 

tggtgggatt attggctatg caaaagcagg 240 

ttgggggcct ggcaggctgg 290 



<210> 6 
<211> 287 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700536004H1 

<400> 6 

attatgtaaa taatgagcaa gatcaaatta 
tagtcagttt tgtcatgggg cagttcaagc 
ggttcttggc accactcacc caagttacat 
tcaccatgca tagtgggatt ttcagccatg 
ctctaaaata acctttgeta ateaatgeag 



acaaagacta gttacccagc attccgcatc 60 
tgccacctga gaacatcact aggctctcag 120 
ccaccagatt attttcagtc ttcacaagta 180 
aataaagggc gtgcgttttg ccatatcagt 240 
tgagttgcta aggttta 287 
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<210> 7 
<211> 264 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700640924H1 

<400> 7 

gtgatgaaat gaggtatctc aaatccactg acagataaga aaacagggtt agagggaaag 60 
tcacctctgt cacgtagagg cagaatatat gaacttaact ctagtttcca tgtctgtctt 120 
tattaccttc atctttctac ttcctggcca caggcatttc acttaattga gcctaatgtc 180 
agtatctgtg tgtgtcaatg tcgttaccac attctgatga agctaaaaaa taaaatttnn 240 
tttgggccaa aaaaaaaaaa aagg 264 



<210> 8 
<211> 238 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 7OO775760H1 

<400> 8 

ganaccgaca ttttaatggt tcttangagg 
tgacgcgtgt tgcngtcctg ctgattctga 
actgaatgga ctagaattat gtgaatctgt 
atactcttta gagaggcagg ttaaataaag 



accaccacta gagtcaaggn ganaatggga 60 
caagagctgn tcactatgac agacagatgg 120 
attatttaca gttggtancc aagagcatcg 180 
gattaagtat ttaggatntg aaatttat 238 



<210> 9 
<211> 112 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700132084H1 

<400> 9 

ctatgcccaa ggaaaaggct ccagaacaca ttccccttct cttcattgcc ttcccatcaa 60 
gcaaggatcc aacctgggag gaccgattcc cagnnncggg ncannaagnn gg 112 



<210> 10 
<211> 238 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700176719H1 

<400> 10 

tcttggtccc ttcacctgac ctccggtgct ccaacgggcg gcagaatgga agaaggtgag 60 
gacccaggaa gtctgattaa agtgatccac ttgctggtct tgtctggtgt ctggggcatg 120 
cagatgtggg tgacctttgc ctcaggcttc ctgcttttcc ggagcctccc gaggcacacg 180 
tttggacttg tgcagagcaa gctcttccca gtctattttc acgtctcctt gggttgtg 238 



3/52 



WO 01/36684 



PCTYUSOO/31743 



<210> l 11 
<211> 247 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701195696H1 

<400> 11 

ggatctttct gggcgagcaa cccgcaaaac 
gttcccgatt ctactcccca cccgcgaccg 
gacgttgagg aggatgactg tctttctgaa 
caggacaaag tggcttttat cactggtggt 
attttca 



gttgtgcatt gcgttgaaaa ggtgcatctg 60 
cacacagcaa acatgaccca gcagccgcct 120 
taccaccacc tcttctgccc ggaccttctc 180 
ggttctggga ttggcttccg gatcgccgag 240 

247 



<210> 12 
<211> 256 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 
<223> 700483259H1 

<400> 12 

gtgacgtaca tggaaaacaa agcctacggg 
aaagcgcctn cggccctgaa gcatggcagc 
ccatgactac tatcggcgta agtagcccct 
tctgtggctg acccgcctcc ccttcccagg 
cggcggaagt gcagag 



gacaggctca agccgcagac agcagcaagt 60 
tatcccttcc agcggctcgc tcgtggctac 120 
cgccagcccc gcccagggct ggcccagggc 180 
acgtctgggc tcctcgtcca gcaacagctc 240 

256 



<210> 13 
<211> 285 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700607235H1 

<400> 13 

ctgaagaccc accatgtctc tgctgactac 
cccagcaact gacacagcct gtatattcaa 
gccctggctt tctgccaatc cagtgccctg 
gtgccagggg acagtgcggg atgtagtctt. 
cctggcgata gtccaacaga tgtttttctg 



tgtactactt ctctggggtt tcattctggg 60 
ggaagcctcg gaaaacagtc ccttgcccag 120 
gatcacacct ggcctgagga cattcctgct 180 
catgctgagg cgggaaggag atgatggttt 240 
gagggagctg gaccc 285 



<210> 14 
<211> 293 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700609074H1 

<400> 14 

ggcgtggagt tggagnagag cgtcaggcgc 
cccaagaagg caggggctct tatgaggaag 
tctatcgtgt acgggctcaa gcagaaagat 



ctccgggaga agtttcatgg aaaagtgtcc 60 
tttggcagcg accacactgg agttgggcgc 120 
ggacaggagc tgagcaacga tttggacgnc 180 
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caggacccac cagaggacat gaagcaggac caagatatcc aggcagtagc cacctctctg 240 
ttgcccctga cgcaagccaa tcttcgaatg ttccaaagag cccaagatga cct 293 



<210> 15 
<211> 268 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.; 700627890H1 

<400> 15 

gtacaangag ngccggggct tgggtctagt 
cagatgagag agttagccga gttaggggca 
gggctaacta taaagaagac tagatttcgn 
accgcccttg tcaaacaaca cagggccagc 
tgctggcatg ggggcacttg taccgtcc 



tggaggggan gcagtggcca gtncagggct 60 
gctactagga tgggggcagg aggagaagcg 120 
cacagtgggt atgtggaagg cagctttcaa 180 
agccttcaag accaggctat ccctgccgtc 240 

268 



<210> 16 
<211> 265 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700629293H1 

<400> 16 

atgaccttta acttttctaa aaatgtgaag 
tntagcattc tttagtgtac ttagtttgat 
tgatttttat gaatgttcat tttaagactc 
ataagcccga gaagaggatt cccttgggtg 
tctgaactgc aaccaaggcc tttnc 



ttttgtactt atatatatca gctaaagtat 60 
gccactttta gtgtttttgt tgcttttgtc 120 
cttgttgaaa tgggacagtt tcgttctttg 180 
ttgacctcct ctgcatgatg tgcccaagca 240 

265 



<210> 17 

<211> 267 

<212> DNA 

<213> Mus musculus 

<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701322438H1 

<400> 17 

acctgccctt acatattgtg gcctgaagng 
attccccctt tcaggaaaga tgttctctta 
cctttgttca agcctttgca attttggcct 
atatttggga gtgggaaggn tctttgattt 
ctaggcctta ggattacaat gaacaac 



acaaaatatg agaagttcaa tgaaaagata 60 
ttttacttgg cgctaaatca aagaatcaag 120 
attttatttc agagagcaaa tggatggtat 180 
ttaaatttca ctgntgagct ttcaaataga 240 

267 



<210> 18 

<211> 239 

<212> DNA 

<213> Mus musculus 

<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701082352H2 

<400> 18 
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atttcttagt ggggcaagga cctgccctta 
gaagttcaat gaaaagataa ttcccccttt 
gctaaatcaa agaatccagc cctttgttca 
agagagcaaa tggttgttat atatttggga 



catattgtgg cctgaaggga caaaatatga 60 

caggaaagat gttctcttat tttacttggn 120 

agcctttgca attttggcct attttatttc 180 

gtgggaagga atcttgattt ttaaatttc 239 



<210> 19 
<211> 244 
<212> DNA 

<213> Mus musculus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701423834H1 

<400> 19 

gtctcctgag tgcttaaatt acaggtgtgt accactaaac caaccctaag aatccatttt 60 

aaaatgtcag tcactttaga tttcttagtg gggcaaggac ctgcccttac atattgtggc 120 

ctgaagggac aaaatatgag aagttcaang aaaagataat tccccctttc aggaaagatg 180 

ttctcttatt ntacttggtg ctaaatcaaa gaatcaagcc tttgttcaag cctttgcaat 240 



<210> 20 

<211> 240 

<212> DNA 

<213> Mus musculus 

<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701423842H1 

<400> 20 

gtctcctgag tgcttaaatt acaggtgtgt 
aaaatgtcag tcactttana tttcttagng 
ctgaagggac aaaatatgag aagttcaatg 
gtctcttatt ttacttggng ctaaatcaaa 



accactaaac caaccctaag aatccatttt 60 
gggcaaggac ctgcccttac atattggggc 120 
nanagntnan tccccctttc aggaaagatg 180 
gaatcaagcc tttgntcaag cctttgcaat 240 



<210> 21 
<211> 224 
<212> DNA 

<213> Mus musculus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701090430H1 

<400> 21 

ggcagctcgg accagtcaga gggccctggc 
gaccaccgtg acggaagccc cggcgcgggg 
ctgtggtgta ctgataaaca gaacccttcc 
ctgtgcgaga actttcttca gantctctgc 



gagggtggca tcggggtgcc atccgaagtc 60 
gtctgcgcgc gacgtcagac acttagctgc 120 
accgtgtgct gcagttttgc ctaaagagat 180 
gccactagta aata 224 



<210> 22 
<211> 249 
<212> DNA 

<213> Mus musculus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700966369H1 

<400> 22 
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gcttttatgt ancccaatca gagcancgac 
agcagaagan ctgcagantt ctgcttgatt 
cacagtagat aggaaaaagg tgtttgacag 
atcagatggc gaagggactt gctgccaggt 
gtgtgctgg 



cagnaaaatt gcaagtnttg agaggcacac 60 
ggcatctatc gttcctcctg agcagcaacc 120 
gcctggctaa gctcttgcgg agccactggc 180 
tgctgtctgt tggacagaag ctcngatgag 240 

249 



<210> 23 
<211> 260 
<212> DNA 

<213> Mus mus cuius 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700828522H1 

<400> 23 

caggcctggt gtggtctcca aagcgactga acaatgcaga aagacagtgg cccattgatg 60 
cctttacatt attttggttt cggctatgca gccctggttg ctaccggtgg gattattggc 120 
tatgccaaag caggtagtgt gccgtccctg gctgctggac tcttcttcgg gggcctggca 180 
ggcctggggg cctaccagct gtctcaggat cccaggaatg tgtgggtttt cctagctaca 240 
tctgggacct tggccggaat 260 



<210> 24 
<211> 246 
<212> DNA 

<213> Mus musculus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701250723H1 

<400> 24 

ctcggcttct cgctgtctgc tcgcgccctc 
tccggctcca gtgttggttg ggtgcaggcc 
cagaaagaca gtggcccatt gatgccttta 
gttgctaccg gtgggattat tggctatgcc 
ggactc 



gtcctacagc acaggcctcc cggctccggc 60 
tggtgtggtc tccaaagcga ctgaacaatg 120 
cattattttg gtttcggcta tgcagccctg 180 
aaagcaggta gtgtgccgtc cctggctgct 240 

246 



<210> 25 
<211> 252 
<212> DNA 

<213> Mus musculus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701254093H1 

<400> 25 

acctcggctt ctcgctgtct gctcgcgccc 
gcttccggct ccagtgttgg ttgggatgcc 
cctggttgct accggtggga ttattggcta 
tgctggactc ttcttcgggg gcctggcagg 
aggaatgtgt gg 



<210> 26 
<211> 237 
<212> DNA 

<213> Mus musculus 
<220> 

<221> misc_feature 



tcgtcctaca gcacaggcct cccggctccg 60 
tttacattat tttggtttcg gctatgcagc 120 
tgccaaagca ggtagtgtgc cgtccctggc 180 
cctgggggct accagctgtc tcaggatccc 240 

252 
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<223> Incyte ID No.: 701423901H1 
<400> 26 

attttggttt cggctatgca gccctggttg 
caggtagtgt gccgccctgg ctgctggact 
taccagctgt ctcaggatcc caggaatgtg 
ccggaattat ggggatgaga ttctacaact 



ctaccggtgg gattattggc tatgccaaag 60 
cttcttcggg ggcctggcag gcctggggcc 120 
tgggttttcc tagctacatc tgggaccttg 180 
cggggaaatt tatnctgcag gntaatc .237 



<210> 27 

<211> 274 

<212> DNA 

<213> Mus musculus 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 701251161H1 

<400> 27 

ggtgtttcgt gggttatctt tgcaaatggg ctccgcggcc tagcgccctg gtggcctaaa 60 

aacgaagcct gcaaggaagg ggttctccgc cgagcgcctc ggtcctgaag catggcagcc 120 

atcccttcca gcggctcgct cgtggctacc catgactact atcggcgtaa gtagcccctc 180 

gccagccccg cccagggctg gcccagggcc ctgtggctga cccgcctccc cttcccagga 240 

cgcctgggct cctcgtccag cagcagctcc ggcg 274 



<210> 28 

<211> 141 

<212> DNA 

<213> Mus musculus 

<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701085115H2 

<400> 28 

aaagtgtccc ccaanaaggc aggggctctt atgaggaagt ttggcagcna ccacaccgga 60 

gttgggngct ctatcgtgta tggtgtcaag cagnaagacg gacangagct gatgcaacga 120 

cctggacgct caggacccac c 3.41 



<210> 29 

<211> 274 

<212> DNA 

<213> Mus musculus 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 701387375H1 

<400> 29 

ggagggctcg ctcttggggc tagtggtggg 
gagaggtggc agaattagag gcagccacta 
gtataaagga nactagattt tgggacagtg 
ttaacaatcc acaaagaacc agnngctttc 
cttggacgtn caggangcac angtttcaca 



gaggcagtgg ccagttcagg gctcagatga 60 

ggatgggggt gcnaggagaa gcggggctaa 120 

gacgtgtgga aggcagcttc caaagcgcct 180 

aagaccaggc tatccctgct gnctgctgna 240 

ggcg 274 



<210> 30 

<211> 257 

<212> DNA 

<213> Mus musculus 
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<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701389479H1 

<400> 30 

agggctcgct cttggggcta gtggtgggga 
gangtggcag aattagaggc agccactagg 
ataaaggaga ctanattttg ggacagtgga 
aacaatccac anagaaccag cagctttcaa 
gacgtcagga ngnacaa 



ggcagtggcc agntcagggc tnagatgaga 60 
atgggggtgc gaggagaagc ggggctaagt 120 
cgtgtggaag gcagnttnca aagcgccttt 180 
gaccangcta tccctgctgc tgctgcactt 240 

257 



<210> 31 

<211> 246 

<212> DNA 

<213> Mus musculus 

<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701389530H1 

<400> 31 

caaggagggc tcgctcttgg ggctagtggt 
tganagaggc ggcanaatta gaggcagcca 
ctaagtataa aggagactag attttgggac 
gcctttaaca atccacaaag aaccagcagc 
gcactt 



ggngaggcag nggccagttc agggctcaga 60 
ctaggatggg ggtgccgagg agaagcgggg 12 0 
agtggacgtg tggaaggcag cttccaaagc 180 
tttcaagacc angctatccc tgctgctgct 240 

246 



<210> 32 

<211> 258 

<212> DNA 

<213> Mus musculus 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 701388372H1 

<400> 32 

gagggctcgc tcttggnggc taagnggtgg 
gagagaggtg gcagaattag aggcagccac 
agtataaagg agactagatt ttgggacagt 
tttaacaatc cacanagaac cagnagcttt 
acttgacgtc cagaaggc 



ggagtcagtg gccacgttca gggctcanat 60 
taggatgggg gngccaggag aagcnggcta 120 
ggacgtgngg aaggcagctt ccaaagcgcc 180 
caaagaccag gctatccctc tgctgctggc 240 

258 



<210> 33 

<211> 257 

<212> DNA 

<213> Mus musculus 

<220> 

<221> misc_feature 

<223> incyte ID No.: 701270715H1 

<400> 33 

gttttctcat gaattgtttt tgcattgttg ataaagctag tatacccttt ggccttagcc 60 
tataaatttt aaatatataa acaaaatatt aaagatgtag ttaattttaa atgaccttta 120 
acttttctaa aaatgtgaag ttttgtactt acatatcatc taaagtatta tagcattttt 180 
aagtgtactt agtttgatgc cacttttagt gttttgttgc ttttgtctga tttttgtgaa 240 
tgttcatnta agactcc 257 



<210> 34 
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<211> 4850 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 2302721 

<400> 34 

cgcacacgtt gcatcttctt cctttcgcgg 
gtactgacag gtggaccagc ggactggtgg 
caggagaccc tccgctagga gctttgctgg 
tttccgttga agaagggaaa gagaatattc 
atgtgaattc tatacttcgc tacttggcta 
ctaatctgat ggaacatact gagattgatc 
cttcatgtga ttcctttact tctacaatta 
catacttagt tggaaactcc ttgagtttag 
gaaatgctgc ctggcaagaa cagttgaaac 
ggtttggctt tcttgaagcc cagcaggcct 
caacaaccaa agctcgagtg gcacctgaga 
ttccaggtgc ggagatggga aaggttaccg 
tacacattgg gcatgcaaaa gctgctcttc 
ggaaactgat catgagattt gatgacacaa 
aggttatctt ggaagatgtt gcaatgttgc 
cggatcattt tgaaactata atgaagtatg 
atgtggatga tactcctgct gaacagatga 
aacatagaaa aaaccctatt gagaagaatc 
gccagtttgg tcagtcctgt tgtttgcgag 
gcatgagaga tccaaccctt tatcgctgca 
aatacaatgt ttatccaaca tatgattttg 
ttacacatgc cctgagaaca acagaatacc 
ttgaagcttt aggcataaga aaaccatata 
acacagtgct atccaaaaga aaactcacat 
gggatgaccc aagatttcct acggttcgtg 
gactgaaaca gtttattgct gctcagggct 
acaaaatctg ggcgtttaac aaaaaggtta 
tactgaagaa agaagtgatc ccagtgaatg 
tagccaaaca cccaaagaat cctgaggttg 
ttttcattga aggtgctgat gcagagactt 
attggggcaa cctcaacatt acaaaaatac 
ttgatgcaaa gttgaatttg gaaaacaaag 
ttgcagagac tacacatgct cttcctattc 
tcacaaagcc agtgctagga aaagacgagg 
agcatgaaga gctaatgcta ggggatccct 
tacaactcca gagaagagga ttcttcatat 
atagttgcaa ggaagccccg tgtgttttga 
tgccaacatc agggtcaaag gaaaagacca 
ctccttttaa ggaaagacca acaccttctc 
ccttggtcct ttacaataga gtggctgttc 
agaaagcacc aaaggaagat gtagatgcag 
aatataagga gaaaactggc caggaatata 
agaatatttc ttctaattcc tcagcaagta 
ttgctgcaca aggggaggtg gttcgtaagc 
taaatgaagc tgtagaatgc ttactgtccc 
aggagtacat acctggtcag cccccattat 
attctgaacc tgctggttta gaaacaccag 
ctcaagggga agtagttcgg aaacttaaaa 
tagctgttca agaactcctt cagctaaagg 
ataagcctgt gtcggccact ggagctgagg 
ataaatctga aaagcagaat aagcctcaga 
ctaaaaacca aggaggtggg ctctcatcaa 
aacagaccag gttgggtctt gaggcaaaaa 
aggtcatcac aaagtcagaa atgattgaat 
gtccctgggc ctatgccatt tgggaagcca 
aacttggtgt tgaaaactgc tacttcccca 
agaagactca tgttgctgac tttgccccag 
ccgagctggc agaaccaatt gccattcgtc 



ggtcctccgt agttctggca cgagccaggc 60 
agatggcgac gctctctctg accgtgaatt 120 
cagtagaaca cgtgaaagac gatgtcagca 180 
ttcatgtttc tgaaaatgtg atattcacag 240 
gagttgcaac tacagctggg ttatatggct 300 
actggttgga gttcagtgct acaaaattat 360 
atgaactcaa tcattgcctg tctctgagaa 420 
cagatttatg tgtttgggcc accctaaaag 480 
agaagaaagc tccagttcat gtaaaacgtt 540 
tccagtcagt aggtaccaag tgggatgttt 600 
aaaagcaaga tgttgggaaa tttgttgagc 660 
tcagatttcc tccagaggcc agtggttact 720 
tgaaccagca ctaccaggtt aactttaaag 780 
atcctgaaaa agaaaaggaa gat ttt gaga 840 
atatcaaacc agatcaattt acttatactt 900 
cagagaagct aattcaagaa gggaaggctt 960 
aagcagaacg tgagcagagg atagaatcta 1020 
tacaaatgtg ggaagaaatg aaaaaaggga 1080 
caaaaattga catgagtagt aacaatggat 1140 
aaattcaacc acatccaaga actggaaata 1200 
cctgccccat agttgacagc atcgaaggtg 1260 
atgacagaga tgagcagttt tactggatta 1320 
tttgggaata tagtcggcta aatctcaaca 1380 
ggtttgtcaa tgaaggacta gtagatggat 1440 
gtgtactgag aagagggatg acagttgaag 1500 
cctcacgttc agtcgtgaac atggagtggg 1560 
ttgacccagt ggctccacga tatgttgcat 1620 
tacctgaagc tcaggaggag atgaaagaag 1680 
gcttgaagcc tgtgtggtat agtcccaaag 1740 
tttcggaggg tgagatggtt acatttataa 1800 
acaaaaatgc agatggaaaa atcatatctc 1860 
actacaagaa aaccactaag gtcacttggc 1920 
cagtaatctg tgtcacttat gagcacttga 1980 
actttaagca gtatgtcaac aagaacagta 2040 
gccttaagga tttgaaaaaa ggagatatta 2100 
gtgatcaacc ttatgaacct gttagcccat 2160 
tatacattcc tgatgggcac acaaaggaaa 2220 
aagtagaagc cacaaaaaat gagacctctg 2280 
tgaataataa ttgtactaca tctgaggatt 2340 
aaggagatgt ggttcgtgaa ttaaaagcca 2400 
ctgtaaaaca gcttttgtct ttgaaagctg 2460 
aacctggaaa ccctcctgct gaaataggac 2520 
ttctggaaag taaatctctg tatgatgaag 2580 
taaaagctga aaaatcccct aaggctaaaa 2640 
tgaaggctca gtataaagaa aaaactggga 2700 
ctcaaagttc ggattcaagc ccaaccagaa 2760 
aagcgaaagt actttttgac aaagtagctt 2820 
ctgaaaaagc ccctaaggat caagtagata 2880 
cacagtacaa gtctttgata ggagtagagt 2940 
acaaagataa gaagaagaaa gaaaaagaaa 3 000 
aacaaaatga tggccaaagg aaagaccctt 3 060 
gtggagcagg agaagggcag gggcctaaga 3120 
aagaagaaaa tcttgctgat tggtattctc 3180 
accatgacat aagtggctgt tatattcttc 3240 
tcaaggactt ttttgatgct gagatcaaga 3300 
tgtttgtgtc tcaaagtgca ttagagaaag 3360 
aggttgcttg ggttacaaga tctggcaaaa 3420 
ctactagtga aacagtaatg tatcctgcat 3480 
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atgcaaaatg ggtacagtca cacagagacc 
tggtgcgttg ggaattcaag catcctcagc 
aggaagggca cagtgctttt gctaccatgg 
ttgacttata tgctcaggta tatgaagaac 
agacggaaaa ggaaaaattt gcaggaggag 
ctgctagtgg aagagctatc cagggaggaa 
aaatgtttga aatcgttttt gaagatccaa 
aaaactcctg gggtctgaca actcgaacta 
acatgggttt agtattacca ccccgtgtag 
gcattaccaa tgcactttct gaagaagaca 
atcgaaggcg attactcagt gttaacatcc 
ctccaggttg gaaattcaat cactgggagc 
ggccacgtga tatgaagagc tgtcagtttg 
tgacagttgc tgaaaatgag gcagagacta 
tcaccctttt cacaagggct tctgaagacc 
tggaagactt tcagaagata ctagattctg 
aaattgactg tgaggactgg atcaaaaaga 
gtgctccatc catgggagct aaaagccttt 
agcctggagc caaatgtgtc tgtggcaaga 
gcagctactg agggatgaac gaaagccccc 
gatattagta tcttctcaga tacagaccat 
aaatgaagtc acacaggaca attattctta 
tctgtaaaca actccagtaa taaatatcat 



tgcccatcaa gctcaatcag tggtgcaatg 3540 
ctttcctacg tactcgtgaa tttctttggc 3600 
aagaggcagc ggaagaggtc ttgcagatac 3660 
tcctggcaat tcctgttgtt aaaggaagaa 3720 
actatacaac tacaatagaa gcatttatat 3780 
catcacatca tttagggcag aatttttcca 3840 
agataccagg agagaagcaa tttgcctatc 3900 
ttggtgttat gaccatggtt catggggaca 3960 
catgtgttca ggtggtgatt attccttgtg 4020 
aagaagcgct gattgcaaaa tgcaatgatt 4080 
gcgttagagc tgatttacga gataattatt 4140 
tcaagggagt tcccattaga cttgaagttg 4200 
tagccgtcag acgagatact ggagaaaagc 4260 
aacttcaagc tattttggaa gacatccagg 4320 
ttaagactca tatggttgtg gctaatacaa 4380 
gaaagattgt tcagattcca ttctgtgggg 4440 
ccactgccag ggatcaagat cttgaacctg 4500 
gcatcccctt caaaccactc tgtgaactgc 4560 
accctgccaa gtactacacc ttatttggtc 4620 
tcttcaactc ctctcacttt ttaaagcatt 4680 
tttatgattt tttaaaaagt aaaagttcta 4740 
tgcctaagtt aacagtggat aaaagacttt 4800 
gaactaaaaa aaaaaaaaaa 4850 



<210> 35 

<211> 1762 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No.: 2742442 

<40O> 35 

attgcgcgag cgcacgggaa aagcgattgg 
ggcccgcagc tccgattggc cgacaggctg 
cgacgtgcaa ccgggaaggg aagaaggggc 
ggctggaaga gaccggagcc gggctccggg 
tggtccggga ggtgacgacc ggcttcggag 
cggccttgag aagggtagtc tcgggatgcc 
aggcgcccgt gcggaatggc agatatttag 
ttccactaca tacctcaatt ttgcctaagg 
ctgcaccatt aataaacaaa aggaaagaat 
tgcaggaaat gtatgatgta gtatcgggag 
gcaaaaaatc agatgttata tcaaagagat 
gatttccacc tgtgttggag cgatatacat 
taaaggcatc ttgtactgat gggagacttt 
gcccaggtct tcctggctac ccaagaactt 
ttcgatcact tctacattcc cagcttgcca 
tggtagctgc ctttgaaaga agagcatgta 
gggagttaat gcttcatgaa gtccatcaca 
gcttctgact ttagtttgtt cacttttagg 
agaaagcatt tgttaaacgc agctttggtt 
agaatataga ctcacttgta catagaatta 
acattatcat gttctgcatt acaataatgg 
tgattcttct gagcagaagt tgaaactgta 
gaaagaggtt agttaagata ttcacacagt 
ttattaaatt cagactattt gtaacttatt 
tttgagtaat catatattta aagtaaaaac 
aatcccagca cttggggaag ctgaggtggg 
agcctggtca acatggcgaa accccatctc 
gtggcacaca cctgtaatcc cagttacttg 
cgggaggcgg aggttgcagt tagccaagat 
agggagactc tgtctccaaa aa 



tcggtcagga gagagaggtg tgtcctggcg 60 
acgggaacgt ttacggtcag cgtgtgtcag 120 
gtgtcaggct gcgcaggcgg ccagtccatt 180 
cccgaccaga ggagggcggt gctgcagggc 240 
agtctatcat ggcagctcgg actggtcata 300 
gtccgaagtc ggcgacagcg gccggggcgc 3 60 
cttcctgtgg tatactgatg agcagaactc 420 
agatatgtgc acgaactttc ttcaaaatca 480 
attcagagag aagaatttta ggatattcaa 540 
tggaggatta caagcatttt gttccttggt 600 
ctggatattg taaaacaaga ttagaaattg 660 
cagtagtaac cttggtgaaa cctcatttag 720 
tcaatcattt ggagactatt tggtgtttta 780 
gtaccttgga tttttcaatt tcttttgaat 840 
cactcttttt tgatgaagtt gtgaagcaga 900 
agctgtatgg tccagaaaca aatatacctc 960 
cataaaggca aaaaagaact ggtgccacct 1020 
aagtattttc atgacatgtt ttcagaagcc 1080 
ataaacctgc accattgaaa atttgcacat 1140 
tttcttcaag tataattcaa aataatatgg 1200 
gatgtcatca ccattgctag aatactggca 1260 
aatttaaacc ttttaattat caccttacct 1320 
atgtattata ttaaccatat cacacttaag 1380 
gttatagggc ctgccgtatg gcttaggata 1440 
tttgggctgg gcacagtggc tcacacctgt 1500 
cagatcagtt gaggtcagga gttctagacc 1560 
tactaaaaat acaaaaatta gctgggcgtg 1620 
ggaggctgag gcacaagaat cgcttgaacc 1680 
cgccctgctg cactccagcc tgggcaacag 1740 

1762 
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<210> 36 
<211> 2862 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 3511087 . 

<400> 36 

ctaagctcag aattcggctc gagtgctttt atttgctggt gttgaaagta gttcagccaa 60 
acccatgaca gcttcatgaa ttttaatcac atcttttttt cttccgcagc cgtcagcttt 120 
agtcagagga ccccttcaga cagccagtgt ctctcctagc atgccctttt cggcatcgct 180 
gttaggaacc ttacccattg gtgcgaggta tgctcctcca ccctccttct cagaatttta 240 
tccacctttg acttcatcct tagaagattt ttgttcttct ttaaattcat tttcaatgag 300 
tgaatccaaa cgagatctgt ccacctcaac ttctagagag ggaacaccgc ttaacaacag 360 
taattcttcc cttttactta tgaatggacc aggtagtttg tttgcttcag agaatttcct 420 
gggaatttca agtcagccta gaaatgactt tggaaacttt tttggaagtg cagttaccaa 480 
accatcttca tcagtgactc caagacatcc cctcgaagga acccatgaat tgagacaagc 540 
ttgccagatc tgttttgtaa aatcaggccc taagttaatg gatttcactt accatgctaa 600 
catagatcat aagtgtaaga aagatatttt aatcggtagg ataaagaatg ttgaagataa 660 
atcatggaaa aaaatacgtc caagaccaac aaaaacaaat tatgaaggac catattatat 720 
atgtaaagat gttgctgctg aggaggaatg tagatattca ggccactgca cgtttgctta 780 
ttgccaagag gagatagatg tgtggacact ggagcggaaa ggagcattca gccgggaggc 840 
tttctttggc ggcaatggaa agattaacct tactgtgttc aaacttctcc aggagcatct 900 
tggggaattt atattccttt gtgagaaatg ttttgatcat aagcctagaa tgataagtaa 960 
aagaaataaa gataattcta ctgcttgttc tcacccggtt acaaagcatg agtttgaaga 1020 
caataagtgc cttgtccaca ttttgcgaga gacaacagta aaatactcca aaatacgttc 1080 
ttttcatggt cagtgtcagc ttgatttatg tcgacatgaa gttcggtatg gctgtttaag 1140 
ggaagatgag tgcttttatg cccatagtct tgtggaactg aaagtctgga taatgcaaaa 1200 
tgaaacaggt atctcacatg atgctattgc tcaagagtct aaacgatatt ggcagaattt 1260 
ggaagcaaat gtacctggag cgcaggtact tggtaatcaa ataatgcctg gatttcttaa 1320 
tatgaagata aagtttgtgt gcgcccagtg tctgagaaac ggtcaagtca ttgaaccaga 1380 
caaaaacaga aaatattgta gtgcaaaagc aaggcattcg tggaccaaag accggcgtgc 1440 
gatgagagtg atgtctattg aacgtaagaa gtggatgaac atccgtcctc tccccacaaa 1500 
gaaacaaatg cctttacagt ttgatctgtg caaccatatt gcttctggga aaaaatgtca 1560 
atatgttgga aactgttcct ttgctcatag tcctgaggaa agagaagttt ggacttacat 1620 
gaaggagaat gggatacaag atatggagca attttacgaa ctatggctca agagtcaaaa 1680 
aaatgaaaaa agtgaagaca tagccagtca gtcaaacaag gaaaatggaa aacaaattca 1740 
catgccaaca gattatgctg aagttacagt ggactttcac tgctggatgt gtgggaaaaa 1800 
ctgcaacagt gagaagcagt ggcagggcca catctcctcc gagaagcaca aagagaaggt 1860 
tttccacacc gaggacgacc agtactgctg gcagcaccgc ttcccaacag gctatttcag 1920 
tatttgtgat aggtatatga atggcacctg cccagaagga aacagctgta aatttgcaca 1980 
tggaaatgcc gaacttcatg aatgggaaga aagaagagat gccctaaaga tgaagctcaa .2040 
caaagcacga aaagatcact taattggccc aaatgataat gactttggaa aatatagttt 2100 
tttgtttaaa gatttaaact aatatgctgg cttttatgta tgatacctaa tcagagcatt 2160 
gaccagaaaa attgaaagtg ttctgaggca catagcagag gagctgcaga tttcctgctt 2220 
gtattggcgt atatcgttcc tcctgagcag caacccacag taggtaggaa aatgggctgt 2280 
ttcacaggcc tggccacgct ctcacggaac cactggcatc agatggtgaa gtgactgcta 2340 
cccggttgcc atctgttgaa cagacttttg gatgaagtgt gttggggaag aggataaggt 2400 
tatatctagg acaactcttt gagttggtcc ttcatataag aatcgtgacg gtaagagaat 2460 
aaacacttgt actgggatca gaatacatga tggatgaaat tctttacatg ttttagcaga 2520 
atgaatttgt ttaatataat aaagtttgct acttatctgt atgtaggttg ctaaaaagga 2580 
ttttcttaac tcagatttta agccaaataa ccatttaaca ctagtatttg ttaaatgggg 2640 
tatttttctg tatttgtatg tttcactata ataagggaat taaggataat gtgcattgag 2700 
aatattttga aaaataattg actcaaattt tatttcttgg tcttttgctg tttaaatgat 2760 
gattttgaaa gattaaacct gtactgttgg tattgtgtta gtgtatggac caatactgcc 2820 
tgtaataaag attttatata tagatgcaaa aaaaaaaaaa aa 2862 



<210> 37 
<211> 1263 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte ID No.: 1968009 

<400> 37 

ccgtccccat tctctgaccg cccctctccc 
cccctctcct cgccccgcca ccgggacgga 
ctcgcagtca ttcctgcggc ttgcgcgccc 
cttgttttct gcaggtgcag gcctggggta 
gcaggacact ggctcagtag tgcctttgca 
tgcttctggt gggatcattg gctatgtaaa 
gctgctcttt ggcagtctag ccggcctggg 
cgtttgggtt ttcctagcta catctggtac 
ccactctgga aaattcatgc ctgcaggttt 
caaagttgga gttagtatgt tcaacagacc 
ctgatgaaga attaaaaatc tgcatcttcc 
gcagcatttt tgcatctgac attttaccta 
tggaaaatca gtcatgatta caaacctaca 
ataagaccct catagagctt gattcttgta 
ccactatttt caatatatta agagaaataa 
taaaaaaaaa gacaccaaac ttggcagaga 
cagaggtggc gagtatgtaa cacaagagct 
tatattgatg ttgtcttttc tttctgtatc 
gtgtcagctt tcagggctct gaaaccccat 
gtcttttagg agatttacaa tatctgttct 
gaaattatgt taagtgaaat atcaatgaaa 
aaa 



ggtacactgc gcaggcacaa cagagccgct 60 
gagcgcccgc cgctgcattt ccggcgacac 120 . 
ttgtagacag ccggggcctt cgtgagaccg 180 
gtctcctgtc tggacagaga agagaaaaat 240 
ttggtttggc tttggctacg cagcactggt 300 
agcaggcagc gtgccgtccc tggctgcagg 3 60 
tgcttaccag ctgtctcagg atccaaggaa 420 
cttggctggc attatgggaa tgaggttcta 480 
aattgcaggt gccagtttgc tgatggtcgc 540 
ccattagcag aagtcatgtt ccagcttaga 600 
actattttca atatattaag agaaataagt 660 
aaaaaaaaga caccaaactt ggcagagagg 720 
gaggtggcga gtatgtaaca caagagctta 780 
tattgatgaa gaattaaaaa tctgcatctt 840 
gtgcagcatt tttgcatctg acattttacc 900 
ggtggaaaat cagtcatgat tacaaaccta 960 
taataagacc ctcatagagc ttgattcttg 1020 
tgtaggtaaa tctcaagggt aaaatgttag 1080 
tccctgctct gaggaacagt gtgaaaaaaa 1140 
tttgctcatc ttagaccaca gactgacttt 1200 
ataaagttta ctataaataa taaaaaaaaa 1260 

1263 



<210> 38 
<211> 978 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 1923127 

<400> 38 

ctcgagccgc gcggcccc^g ggcgcacgcg 
tgctgtgtcc cgcggcttgc gctccgtagt 
ctggggtagt ctcctttctg gactgagaag 
gtgcctttgc attggtttgg ctttggctac 
ggctatgtaa aaacaggcag cgtgccgtcc 
gccggcctgg gtgcttacca gctgtatcag 
gctacatctg ttacttttgt tggtgttatg 
atgcctgtag gtttaattgc aggtgccagt 
atgttgatga catctgatta gcagaagtca 
aatctgcatc ttccactatt ttcaatgtat 
tgacatttta cctaaaaaaa aaaagacacc 
ttaccattat aaccctacag aggtggtgag 
agagatcgat tcttgtatat tgattttatc 
gtaaaatgtt aggtgttgac attgagaacc 
gtgtgaaaaa aaatctcttg agagatttag 
cagactgact ttgaaattat gttaagtgaa 
aataaaaaaa aaaaaaaa 



cacgcaatcg cgtttccgga gagacctggc 60 
ggactccgcg ggccttcggc agatgcaggc 120 
agaagaatgg agaagcccct cttcccatta 180 
acagcactgg ttgtttctgg tgggatcgtt 240 
ctggctgcag ggctgctctt cggcagtcta 300 
gatccaagga acgtttgggg tttcctagcc 360 
ggaatgagat cc tact acta tggaaaattc 420 
ttgctgatgg ccgccaaagt tggagttcgt 480 
tgttccagct tggactcatg aaggattaaa 540 
taagagaaat aagtgcagca tttttgcatc 600 
aaatttggcg gaggggtgga aaatcagttg 660 
catgtaacat gagcttattg agaccatcat 720 
tctttctgta tctataggta aatctcaagg 780 
ctgaaacccc attccctgct cagaggaaca 840 
aatatctttt cttttgctca tcttagacca 900 
atatcaatga aaataaagtt tactataaat 960 

978 



<210> 39 
<211> 851 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No. : 3123954 
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<400> 39 

cggcacgcgt ggggtccgcg cgtgcgcacc 
tcccgcggtc ttctctgcaa atgggctccg 
gatcgtgcgc cgaggcccgc gaggggtcgc 
gcatggcggc catcccctcc agcggctcgc 
gcctgggttc cacttccagc aacagctcct 
ttccccaccc cccaggtctc cccaaggctg 
tcgggaagtc caccctcccg ttcatggcca 
ctccccaggc ctccagcagc atgaccgcct 
agcccggcgg tcagtccagc acagccagcg 
caccagcccc aggcctgcgg aggcgctagt 
aatccgcatc cctcgccccc ctccccacct 
gcagcaagca gaccttcgca tcaacacagc 
tctaccgccc ggccccagca ctcgctagct 
ccaagcggaa aataaactcc aagcagccag 
aaaaaaaaag g 



ccgcgcgcgc ctctctgtcg tggcgcggct 60 
tggcctagcg cccccgtccc cgccacccgt 120 
cgcccagatc ccaccagcca gcaagctaaa 180 
tcgtggccac ccacgactac taccggcgcc 240 
gcagcagtac cgagtgcccc ggggaagcca 300 
acccgggtca ttggtgggcc agcttctttt 360 
cggtgttgga gtccgcagag cactcggaac 420 
gtggcctggc tcgggacgcc ccgaggaagc 480 
ctgggccccc gtcctgacct gagcggttac 540 
ccaccagagc ccctccccgc ccctntcccn 600 
cccacccccc accctgtaaa ctaggcggct 660 
agacaccaaa aaccagtgag agccccgctc 720 
ttcctgacac ctggaactgt gcacctggca 780 
taaaaaaaaa aaaaaaaaaa aaaaaaaaaa 840 

851 



<210> 40 

<211> 1907 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No.; 1321844 



<400> 40 

tggaccgacg ggcgcaccca ggtagggggg 
gggaactgcg ccgccgccac catgtctcag 
ggcctccggg agaagtttca tgggaaggta 
aaattcggca gcgaccacac gggagtgggg 
gatggccaag aactaagtaa cgatctggat 
gaccgggaca ttcaggcagt ggcgacctcc 
atgtttcaac gtgcccagga cgaccttatc 
tcctgcgacc acgtctggtg gcgccgcgtg 
aaatgccgga agcgctacga gccagtgcca 
cactgcccga agtgtcggca caacttccgg 
tgctacgggt gcggcttccc cgtgtatcca 
gacccggatc gccgcagcac ccacactcac 
cgagagcccc acgtgcctgg gacatcctgt 
ctgcccaaag tgctccaccc cagcaaccct 
tgcttgagcc agggtggcct cctggaagac 
gaggaggagg aggaagagga ggaggtggag 
ctgccaggtg cagatacaaa ccagacacgg 
tgagctcaaa ccgagatatg aatgaccttg 
gggggattcc tgggtcccat tttcagcgcc 
gtgggacaca ttggcactga gccacaaaga 
tgaccaatgt cctctagggc ctaggggaca 
gaatgagcat ggcggccttc ctgagagaat 
caccatcctg gtagcggctt cggtagtggc 
agtgggcaca ggcatggtac caccagcctc 
ggagagagag agtccatgtc ctctcaccag 
caaggctttt gccaggcatc ccctggcccc 
ttccatctcc tcagcaaaaa aataggagcc 
ccttaattcc ttccctatct ccttaccaaa 
caaactagga gtctaccgtt cattccttta 
ataagagtac tagctctcac cctctgccct 
tgcctcatca gagaaggaat ctggactccc 
tttgttctct tgacggaata aaagcttgct 



cggctgagcc gcgcagtgcg gaccctcgcg 60 
gaaggtgtgg agctggagaa gagcgtccgg 120 
tcctccaaga aggcgggggc tctgatgagg 180 
cgctccatcg tgtacggggt aaagcaaaaa 240 
gcccaggatc caccagaaga tatgaagcag 3 00 
ctcctgccac tgacagaagc caacctacgc 360 
cctgctgtgg accggcagtt tgcctgctcc 420 
ccccagcgga aggaggtatc ccggtgccgg 480 
gctgacaaga tgtggggcct ggctgagttc 540 
ggctgggcac agatggggtc cccgtccccc 600 
acacggatcc tccccccgcg ctgggaccgg 660 
tcctgctcag ctgccgactg ctacaaccgg 720 
gctcacccca agagccggaa gcagaaccac 780 
cacattagca gtggctccac tgtggccacc 840 
ctggacaacc tcatcctgga ggacctgaag 900 
gacgaggagg gcgggcccag ggagtgaccc 960 
tctgtggcta ctttgtgtta ttataagata 1020 
gggagccatc tgaggccaag atattgacgg 1080 
cagggtcaca gatccacagt gggaagttct 1140 
aggtgtggcc agaacaactt gggctcctgc 1200 
gaggaacaca gagtcacagc ttcaggggcc 1260 
atgccccacc acgaaactca gcccagtaga 1320 
cgccgtggtg ccacacaccg ttgaggttgg 1380 
cccgctggta cagggcacag ttacctgagg 1440 
aataaaagcc tctacctgca cctcacagtg 1500 
tcccattctt attgaataca agccctgatc 1560 
ctggcccccc aactttcttc agagtaatag 1620 
gtacaagtca catctttccc accttttctg 1680 
tcaaagaaaa gtatctactt cctttctaga 1740 
ttacttgaac aggagtcttg attctttttt 1800 
catcccccca ccaggataaa agtcctgacc 1860 
tatccttaaa aaaaaaa 1907 



<210> 41 
<211> 3689 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 375724.9 

<400> 41 

tgggtccccc aggagagcct ctaaggtcac 
gcatggtgcc tcctccctga caaccacctc 
ggcaccagca tggatctctg cctgctgccg 
atcagttctt tataccaagt cctttgtgaa 
tgtccacctg gcttccaaga tagaagaggc 
gtttcaccgc cttcgacagc tgagagacaa 
gagtgtgagc gtaaccaaca cctggtccag 
ctaagacttc tcctctgctg tccaagcgct 
atccagtgca ggctctccac cgaaggctgg 
gacagttggc agcggctctg atgagcccga 
cctccgcacg atgctcccac gcgtccaact 
tggactcctt caaggaagct gcagctcggt 
tggaggaaga ggaagaggaa cccaagagga 
tgtgctcagc cagttcattt gagtttgcat 
agatttcttt aatcaaaagc gttttagtga 
gtgctttgca tgagagcagg ccaaggttga 
ttcatagcaa cacgtattgt ctgacattca 
gcctttctgt cctgtattta ctgtatttag 
tcttgagatt cacaggactg tctagctctg 
gtacgtttat ctagattttc tctagataat 
ggcattgcag ttcttcctgt gtttggcttc 
cgacgtcttc gtggcggttc cagccagaga 
cccggacgct ggagatccct ttgcccaatc 
ctgaagaaga aattcaggaa atctgcttaa 
ttgatctcac acacctggag ggtgaagtgg 
aaggcccaag cccggggcct gttgcctggg 
ttctctcctg cccccaagct ggtggaatcc 
ccactgtctg tgaagaacac caagaggagg 
agccccgtga acggcttgcc aaaggggcga 
cagagctact cgaggtcccc atcccgatca 
cggctccaca tctggtgggt ccaagttcgc 
caccgagaca ggccccccgc agcgctccct 
agtccaagga ctgcaagtac ccccagaagc 
cgttctcgaa gcaggtcact gggagcgggc 
tcattactac agagatcagc gacgagagcg 
ctatgagcgg gaccaccctg ggcacagcag 
tggtggccgc aagcccttcc ctggggagta 
tgccctttga cccccgggtg tgctctcagc 
gaaatgtctg tcgactggac cttggtggat 
gtgcttcaga ccggtctggg gtgcggcgca 
agcagctctg agggcagctc aatgaaaaag 
ggcatggcct ggtgctatcg gcagccgctc 
aagccaagaa agatgatttt tagaaccttt 
tttgcagtgt ttcacaggag aaagtggcct 
aaataaacat gtgtttaata caagttaaag 
gtcagcttaa aacttgtgta gggaatcctg 
tgttgaaagt cgtatatttt tatctgtgcg 
agaggttccc tatatgcgcg tatgacatgg 
atccggagcg ttatggacgt ggtaggtggt 
gtgtgcacag ccagtgcggc ccacttccgg 
ggattttgca tccaatcctg tgtgcctgcc 
cctctgccca tcaggatgac aggctccttc 
acacagaatg attccaaggt gctctcgttc 
tctgtgggcg agagaggcag ctgggagcag 
cacctgcact ctgagtgccc cccactgtca 
ggttttgggg actagggggc cccactcctg 
ggacatatag gccggggcta gtgggataac 
gaatcctgca ttagatagct ggtgggaccc 
ccagggaggg tgtccttcca gggagagcag 
ggaagtaaga gaaccagtaa agggccatac 



acagggtgcc cactgcagac aggctatagt 60 
cacttcacac cagccacagc aaggaacttt 120 
atggcatgac tgtcgaacag gtggtgttcc 180 
gcattccaca gagcatgtgt caatggcctg 240 
cccaagacgc atacgggacg tcatcaatgt 3 00 
aaaataatcg ttatgtacct tcaggtgtta 3 60 
acctcatggg tagcctctga gggtaagtga 420 
ttggtgcagg gacagcggca tcttcagcca 480 
ctctagactg gtgacccctt gttgaaatgg 540 
gaagaggcct gcccttgggt gcggagtctc 600 
tgcacccaag gggcttttcc ctcttccaag 660 
cagcagagaa ggggcctgcc gccagcgccc 720 
tggcttgtct cccagcagcc acaccggctt 780 
gtttctctgc actatggatt ttgagcattt 840 
ctccagtaga cattttcttt ctgaggcatc 900 
ggggaaaagt aaagttaaag tcggttctct 960 
gccagctttt ttttttctaa taatttctgt 1020 
aaaaagcagc tagaatattt ctccattaac 1080 
agtcctagca atagactcct tagaggagta 1140 
gcaggcggaa gacctgggtt ccccgggtgg 1200 
caggaattac atgaacgaca gccttcgcac 1260 
gcatcgcctg tgcctgcatt tatcttgctg 1320 
gtccccattg gtttcttttg tttggagcaa 1380 
agatcttgca gctttatgct cggaaaaagg 1440 
aaaaaagaaa gcacgctatc gaagagggca 1500 
ggcacacagg tgctggatgg tacctcgggg 1560 
cccaaagaag gtaaagggag caagccttcc 1620 
ctggagggcg ccaagaaagc caaggcggac 1680 
gagagtcgga gtcggagccg gagccgtgag 1740 
gcgtctgcct aagaggagga aaagtgacag 1800 
agagccgctc ccggagcagg agtgactccc 1860 
acaaaggctc tgagattcgg ggctcccgga 1920 
cacacaagtc tcggagcccg gagttcttcc 1980 
ggataatccg ggaaaataca agaagaaaag 2040 
ctcgaggtcg tatgaacgca caggccgtcg 2100 
gcatcggagg tgaggcgggg ttgcagtgac 2160 
cctgatggct gccctttgac ccccggtggc 2220 
gcaagtggtc ctagaacagg attctttttg 2280 
ttggaaatgg aactgaggga ccggtgacac 2340 
cacctgggcc cgtgcagggc tcagctcggc 2400 
tgaatgcaca cgcccttgtt ggcgtggcct 2460 
tccactcccc gactgatact caattacgtg 2520 
gcctatatta ggttgtactt atgtacatat 2580 
taactgcccc ttattctctc tccacgttgt 2640 
ctatgtatga aaactcagaa cttgaatccc 2700 
acttttaaaa tgtgagggta tttggatctg 2760 
gtgctgagtg caggccacca gctcctaaat 2820 
tgaataaaca caactctctc cactcaggac 2880 
cgttctgtgt gcttgtgaaa gtgtccaggc 2940 
gctccttgct ccctgctgta ctgaagtttt 3000 
cttctgccga agcttgtgag gggcctgagt 3060 
ctgcagggcc ataggaggga agttttggaa 3120 
ctgaggggga ctggtttgta acccatgaca 3180 
gacacttgga gggtcacccc acgggggtgg 3240 
tcagctgcct cttaccgtgg acacagtttt 3300 
gtggtaccgt ttggacttac tagggcagtg 33 60 
ggggagttac gcctgatgac ttttttgatg 3420 
ccccctcaga attggggaac tgaggagact 3480 
ctatgagggg ccccctagct tcctgtgcct 3 540 
acacctgtac ccaagagacc gctctccatt 3 600 
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tgctttcttt ttttactaaa taattgtaaa atattattat gacataaaga accatttaag 3660 
gccanaaana anaagactna naaaaaaag 3689 



<210> 42 
<211> 408 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 375724.3 

<400> 42 

tttgtattaa acacatgttt atttacaacg 
ctttctcctg tgaaacactg caaaatatgt 
tctaaaaatc atctttcttg gcttcacgta 
ctgccgatag caccaggcca tgcaggccac 
attgagctgc cctcagagct gctgccgagc 
ccagaagacg tcggtgcgaa ggctgtcgtt 
ttggttacgc tcacactcta acacctgaag 



tggagagaga ataaggggca gttaaggcca 60 
acataagtac aacctaatat aggcaaaggt 120 
attgagtatc agtcggggag tggagagcgg 180 
gctaacaagg gcgtgtgcat tcactttttc 240 
tgagccctgc acgggcccag gtgtgcgccg 300 
catgtaattc catgaggtct ggaccaggtg 360 
gtacataacg attatctt 408 



<210> 43 
<211> 3136 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 1867333 

<400> 43 

cgacgccggc gtgatgtggc ttccgctggt 
cctctgcaaa gtttacttgg gactattctc 
tgtcaaacgg cccccagcgc ccc tggtaac 
acaagctttt tcagccaacc aagtgccgga 
ctttgggggc ctggctgcag ctgcaattct 
ggaacaacat accaaggcag ggggctgctg 
tgacacagga atccattaca ttgggcgtat 
ggacc agate actgaagggc agctggactg 
ggtactggaa gggcccaatg gecgaaagga 
cattcagggc ctcaaggaga agtttccaca 
gctggttaag gtggtatcca gtggagcccc 
gcccgtggtt cagctcctcg acaggtgtgg 
agcatccacc cagagectgg ctgaggtcct 
ggcagtactc agctacatct tccccactta 
catgcacgcc ctgctggtca accactacat 
cagtgaaatt gccttccaca ccatccctgt 
aaaggecact gtgcagagtg tgttgctgga 
gaagaagggg catgagctgg tgaacatcta 
gttcaacacc tatgaacacc tactgeeggg 
gcaactgggg aeggtgegge ceggcttagg 
caccaaggaa gacctgcatc tgccgtccac 
ggaccaggcg atggagcgct acgtctccat 
tcttctcttc ttcgctttcc catcagccaa 
ccggtccacc atgatcatgc tcatacccac 
ggagctgaag ggaaagcggg gcagtgacta 
ctctatgtca gtggtcctga aactgttccc 
tgeaggatec ccactcacca accagttcta 
ggctgaccat gacctgggcc gcctgcaccc 
ccccatcccc aacctctatc tgacaggeca 
ectgeaaggt gccctgctgt gcagcagcgc 
taagaatctt gattctagga tccgggcaca 
agtcagagga atttgeccaa tggctggggc 
tgcattagtt ccttgcacgt ataaagcact 



gctgctcctg gctgtgctgc tgctggccgt 60 
tggcagctcc ccgaatcctt tctccgaaga 120 
tgacaaggag gecaggaaga aggttctcaa 180 
gaagctggat gtggtggtaa ttggcagtgg 240 
agctaaagct ggcaagegag tcctggtgct 300 
tcataccttt ggaaagaatg gecttgaatt 360 
ggaagagggc ageattggee gttttatctt 420 
ggctcccctg tcctctcctt ttgacatcat 480 
gtaccccatg tacagtggag agaaagecta 540 
ggaggaagct atcattgaca agtatataaa 600 
tcatgccatc ctgttgaaat tcctcccatt 660 
getgetgact cgtttctctc cattccttca 720 
gcagcagctg ggggcctcct ctgagctcca 780 
cggtgtcacc cccaaccaca gtgectttte 840 
gaaaggaggc ttttatcccc gagggggttc 900 
gattcagegg gctgggggcg ctgtcctcac 960 
ctcagctggg aaagcctgtg gtgtcagtgt 1020 
ttgccccatc gtggtctcca aegcaggact 1080 
gaacgcccgc tgcctgccag gtgtgaagca 1140 
catgacctct gttttcatct gectgegagg 1200 
caactactat gtttactatg acaeggacat 1260 
geccagggaa gaggctgegg aacacatccc 1320 
agatccgacc tgggaggacc gattcccagg 1380 
tgectacgag tggtttgagg agtggcaggc 1440 
tgagaccttc aaaaactcct ttgtggaagc 1500 
acagctggag gggaaggtgg agagtgtgac 1560 
tetggctget ccccgaggtg cctgctacgg 1620 
ttgtgtgatg gcctccttga gggeccagag 1680 
ggatatcttc acctgtggac tggtegggge 1740 
catcctgaag eggaacttgt actcagacct 1800 
gaagaaaaag aattagttcc atcagggagg 1860 
atctcccttg acttacccat aatgtctttc 1920 
ctaatttggt tetgatgect gaagagaggc 1980 
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ctagtttaaa tcacaattcc gaatctgggg 
gtgagatctt tacgcctttt ataacatgcc 
gcttgatgtc tcatgacgag cggcgctctg 
atcaaagcga atattccatc tgtggataga 
tgggttcagt tctgtcctga ggcttctgct 
ctacactgtc aagggaaaag ggagactaat 
tggttgccat tccataggtt tggagagctc 
cttcagggga caggaaatgc ctgtgtctgg 
gcaggatcca tcagttagta gggtgcatgt 
cccgggtctg tcttccttat catcggggtg 
tcagtacctg agcctcaatc aagccttatc 
gaagggtgac atcaggagtc agggcatgga 
gcaggctgca gggcattcca gccaagggca 
taagggaggg aagtcacatc agaaaaggga 
tggcatttgc agttaattag cacatgtgag 
ggtttggaaa aatgactttt cagttatgtc 
gtagttcgtg ttaatgtaac attaataaat 
tttattttgg ttttttgttc aagaaaacaa 
tcagtaaaac actatatact gaataacacc 
aattataaaa aaaaaa 



caatggaatc actgcttcca gctggggcag 2040 
atccctacta ataggatatt gacttggata 2100 
catccctcac ccatgcctcc taactcagtg 2160 
acccctggca gtgttgtcag ctcaacctgg 2220 
ctcattcatt tagtgctacg ctgcacagtt 2280 
gaggcttaac tcaaaacctg ggcatggttt 2340 
tagatctctt ttgtgctggg ttcagtggct 2400 
ccagtgtggt tctggagctt tggggtaaca 2460 
cagatgatca tatccaattc atatggaagt 2520 
gcagctggtt ctcaatgtgc cagcagggac 2580 
caccaaatac acagggaagg gtgatgcagg 2640 
ctggtaagat gaatactttg ctgggctgaa 2700 
cagcagggga cagtgcaggg aggtgtgggg 2760 
aagccacgga atgtgtgtga agcccagaaa 2820 
ggttagacag gtaggtgaat gcaagctcaa 2880 
tttggtatca gacatacgaa aggtctcttt 2940 
ttattgattc cattgcttta acatttgaaa 3000 
aactattatt gtgatggcat ttgcagaagc 3060 
aaaataagct ttaaaaaaat aaaattaagt 3120 

3136 



<210> 44 

<211> 1376 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No.: 1461451 

<400> 44 

ccacgcgtcc gcggacggtg ggtcgcccac 
gatgagatcc cggcctcagg gtggacgcag 
gcgcctgtgt ggtacttggt agcggcggct 
cgcagccggg gccgggcggc atcagccggc 
ggagcaggcc gggtggccca gcctgggccc 
aggcctcggc gccggaggga cc tgggcagc 
gtggcctggg cagaagcaga tgagaacgag 
gaaggtgtcg agaagccagc ggaaactcac 
cggaagctgg aggagaaaca agcgcgaaag 
gaggagcgga aacgactcga gtcccagcgc 
cttcgcctgg aggaggagca gaaggaggag 
cagcgggagc atgaggagta cctgaaactg 
gtaggagaga ccatgactga ggaacagtcc 
atcaagcagt ccaaggttgt gctcttggaa 
caggacacca taaatcgcat ccaggacctg 
gacgaccggg gcaagttcat ctacataacc 
atccgacagc ggggccgggt gtccatcgcc 
gcctggggcc gggagtcccc tgcccaagcc 
ctcagagttg gtgtggccta cctggctata 
agtgatggtg tggccaggca gttatagatt 
tggcttggtg tggcagaagg cctggcctag 
tcagaatata tccgagaggt ggggagggtc 
tatgaatcca ttcattcaag aaaatagcct 



<210> 45 
<211> 649 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 2345712 

<400> 45 



gcgtccgccc acgcgtccgc ccacgcgtcc 60 
tggttctgca ctgaggccct cgtcatggtg 120 
ctgctagtcg gctttatcct cttcctgact 180 
caagagccac tgcacaatga ggagctggca 240 
ctggagcctg aggagccgag agctggaggc 300 
cgcctacagg cccagcgtcg agcccagcgg 360 
gaggaagctg tcatcctagc ccaggaggag 420 
ctgtcgggga aaattggagc taagaaactg 480 
gcccagcgtg aggcagagga ggctgaacgt 540 
gaagctgagt ggaagaagga ggaggagcgg 600 
gaggagagga aggcccgcga ggagcaggcc 660 
aaggaggcct ttgtggtgga ggaggaaggc 720 
cagagcttcc tgacagagtt catcaactac 780 
gacctggctt cccaggtggg cctacgcact 840 
ctggctgagg ggactataac aggtgtgatt 900 
ccagaggaac tggccgccgt ggccaacttc 960 
gagcttgccc aagccagcaa ctccctcatc 1020 
ccagcctgac cccagtcctt ccctcttgga 1080 
catcttcatc cctccccacc atcctgggga 1140 
aaaggcctgt gagtactgct gagcttggtg 1200 
gatcctagat aagcaggtga aatttaggct 1260 
ccttggaagc tggtgaagtc ctgttcttat 1320 
gttgcacatt taaaaaaaaa aaaaaa 1376 
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ctacgacccg attggcttcg ggctcagctg 
acccacgagg ggacgcggcc gaggagggtc 
accagattca gcccatttgg ccccgacgcc 
gaggtcccgg ttcctaacgg actgcaagat 
taagatggtc catctactgg tcttgtcagg 
cgtctcaggc ttcctgcttt tccgaagcct 
caaactcttc cccttctact tccacatctc 
cttggcttca cagcatgctt gggctcagct 
gctgttcctg agccttacgc tggccactgt 
agctgccatg tgggccctgc aaaccgtgga 
aggcagccac cagggttccg atccctaccg 



ggaggcggga cgaattattg gttgggggaa 60 
gctgtccacc cgggggcgtg ggagtgaggt 120 
tctgttctcg gaatccgggt gctgcggatt 180 
ggaggaaggc gggaacctag gaggcctgat 240 
tgcctggggc atgcaaatgt gggtgacctt 300 
tccccgacat accttcggac tagtgcagag 3 60 
catgggctgt gccttcatca acctctgcat 420 
cacattctgg gaggccagcc agctttacct 480 
caacgcccgc tggctggaac cccgcaccac 540 
gaaggagcga' ggcctgggtg gggaggtacc 600 
ccagctgcga gagaaggac 649 



<210> 46 
<211> 1554 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 1810320 

<400> 46 

ctcccggttc caggcgagtt cgcagctgcg 
gcccgttgtc cccgcagtcc ccgacgggag 
gggggacgac tgtctccccg cgtaccgcca 
agtggccttc atcacaggag gcggctctgg 
gcggcacggc tgccatacgg tgattgccag 
caggaagctg gctggggcca ccggccggcg 
gcccccagct gtcatggccg ccgtggacca 
tctcattaac tgtgcggccg ggaacttcct 
cttcaagacc gtgatggaca tcgataccag 
tgagaagttc ttccgggacc acggaggggt 
ccgggggcag gcgctccagg tgcatgcagg 
gcggcacttg gctgtggagt ggggtcccca 
ccccatcagt ggcacagagg ggctccggcg 
caaggtcact gccagcccgc tgcagaggct 
gctctacctg gccagccctc tggcttccta 
cggggcatgg ttgacgttcc caaacggtgt 
tgctaagctc taggaatctt ccggccgctg 
agcaccaatc tgaaccagca atgcctgcag 
actgcgcttt ccctccccac ggccccaact 
ggcccgtgga gctgccacgc aggtgcctga 
caggtgccac gcaggtggtg ggggtacaga 
tcaagggtgt cctgcctgcc tgggtccagg 
tgtgttctct tggctgaaaa cactgaggtg 
atggtcctcc agctgcccac aaggtccgcc 
acacactttg ctacaatctt gctagtgcgt 
aataaattgg actttgcaaa agcttttaga 



cgccgggtcc tggaggccga ggccgctccc 60 
cgccatggcc cagccgccgc ccgacgtgga 120 
cctcttctgc ccggacctgc tgcgggacaa 180 
gattgggttc cggattgctg agattttcat 240 
taggagcctg ccgcgagtgc tgacggccgc 300 
ctgcctccct ctctctatgg acgtccgagc 3 60 
ggctctgaag gagtttggca gaatcgacat 420 
gtgccccgct ggcgccttgt ccttcaacgc 480 
cggcaccttc aatgtgtctc gtgtgctcta 540 
gatcgtgaac atcactgcca ccctggggaa 600 
ctccgccaag gccgctgtgg acgcgatgac 660 
aaacatccgc gtcaacagcc tcgcccctgg 720 
actgggtggc cctcaggcca gcctgagcac 780 
ggggaacaag accgagatcg cccacagcgt 840 
cgtgacgggg gccgtgctgg tggccgatgg 900 
caaagggctg ccggatttcg catccttctc 960 
cttcctgccg cctcactcag ccaggtggag 1020 
cccagcccct cctctgaaca ctcagctatt 1080 
ccagggcagg agcaactgga cagtgggcct 1140 
gggccaggtg ccacgcaggt gtctgaggac 1200 
caagatgctg ggatgtcccc tgccccatgg 1260 
gcctgaggga gccacatgga tcccgagact 1320 
ctcccatctg tgcgtggccc atgagctggg 1380 
cctctgtctc tgcaccacct gtttgcataa 1440 
tttcttaaaa gataatctat ttactgtaaa 1500 
aggaaaagaa agaggattaa aggg 1554 



<210> 47 

<211> 1083 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 964996 

<400> 47 

gagccgtcag tcttacaaag tcgtgactgg 
tgcagcacat gcctctgacc gccttcggca 
gggctcctct ggaaaccagg gacccgatgc 
ggactatagg aaggagggcc ctcctggacc 
gatggcgtgc aacggcggct cgttggggac 



caaaacctgg cgttaccaac ttaatcgcct 60 
cgtccagatt ctgtgggaca tacagggtct 120 
cggagggtag cttggctctg gagcagcctg 180 
cgggagcgga ccctggtggc ggtgaagccc 240 
gtgatccagc gctttgagag gcggggcttc 300 
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acgctggtgg ggatgaagat gctgcaggca 
gacctgcgga ggaagccctt ctaccctgcc 
gtggccatgg tctgggaagg gtacaatgtc 
accgactcgg ctgaggctgc cccaggaacc 
aggaatgtca tccacgccag • cgactccgtg 
ttccagagca gtgagctggt gagctgggca 
gcctgaggct caagctgccc ttaccacccc 
agcaagaacc caagcccaca tccaaacctg 
cctctgcccc accccagccc agaggagttt 
ccaagccagc acaagattgg accaatcctt 
gtgggggggg gtcttcacat tatcataacc 
ctgtgcccag cacatgggtg gtacactaat 
gacgccttta tgcaagttgt aaggagttga 
aaa 



ccagagagcg tccttgccga gcactaccag 360 
ctcatccgct acatgagctc tgggcctgtg 420 
gtccgcgcct cgagggccat gattggacac 480 
ataaggggtt acttcagcgt ccacatcagc 540 
gagggggccc agcgggagat ccagctgtgg 600 
gatgggggcc agcacagcag catccaccca 660 
atcccccacg caggaccaac tacctccgtc 720 
cctgtcccaa accacttact tccctgttca 780 
gagccaccaa cttcagtgcc tttctgtacc 840 
tttgcaccaa agtgccggac aacctttgtg 900 
tctcctctaa aggggaggca ttaaaattca 960 
tatgacttcc cccagctctg aggtagaaat 1020 
acagtaaaga ggaagttttg cacaaaaaaa 1080 

1083 



<210> 48 
<211> 1512 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 2302721 

<400> 48 



Met 


Ala 


Thr 


Leu 


Ser 


Leu 


Thr 


Val Asn Ser Gly Asp Pro Pro Leu 


1 








5 






10 15 


Gly Ala 


Leu 


Leu 


Ala 


Val 


Glu 


His Val Lys Asp Asp Val Ser He 










20 






25 30 


Ser 


Val 


Glu 


Glu 


Gly 


Lys 


Glu 


Asn He Leu His Val Ser Glu Asn 










35 




40 45 


Val 


He 


Phe 


Thr 


Asp 


Val 


Asn 


Ser He Leu Arg Tyr Leu Ala Arg 










50 






55 60 


Val 


Ala 


Thr 


Thr 


Ala 


Gly 


Leu 


Tyr Gly Ser Asn Leu Met Glu His 










65 




70 75 


Thr 


Glu 


He 


Asp 


His 


Trp 


Leu 


Glu Phe Ser Ala Thr Lys Leu Ser 








80 




85 90 


Ser Cys 


Asp 


Ser 


Phe 


Thr 


Ser 


Thr He Asn Glu Leu Asn His Cys 








95 






100 105 


Leu 


Ser 


Leu 


Arg 


Thr 


Tyr 


Leu 


Val Gly Asn Ser Leu Ser Leu Ala 








110 




115 120 


Asp 


Leu 


Cys 


Val 


Trp 


Ala 


Thr 


Leu Lys Gly Asn Ala Ala Trp Gin 










125 






130 135 


Glu 


Gin 


Leu 


Lys 


Gin 


Lys 


Lys 


Ala Pro Val His Val Lys Arg Trp 










140 






145 150 


Phe Gly 


Phe 


Leu 


Glu 


Ala 


Gin 


Gin Ala Phe Gin Ser Val Gly Thr 










155 






160 165 


Lys 


Trp 


Asp 


Val 


Ser 


Thr 


Thr 


Lys Ala Arg Val Ala Pro Glu Lys 










170 






175 180 


Lys 


Gin 


Asp 


Val 


Gly 


Lys 


Phe 


Val Glu Leu Pro Gly Ala Glu Met 








185 






190 195 


Gly Lys 


Val 


Thr 


Val 


Arg 


Phe 


Pro Pro Glu Ala Ser Gly Tyr Leu 










200 






205 210 


His 


He 


Gly 


His 


Ala 


Lys 


Ala 


Ala Leu Leu Asn Gin His Tyr Gin 










215 






220 225 


Val 


Asn 


Phe 


Lys 


Gly 


Lys 


Leu 


He Met Arg Phe Asp Asp Thr Asn 










230 






235 240 


Pro 


Glu 


Lys 


Glu 


Lys 


Glu 


Asp 


Phe Glu Lys Val He Leu Glu Asp 










245 






250 255 


Val 


Ala 


Met 


Leu 


His 


He 


Lys 


Pro Asp Gin Phe Thr Tyr Thr Ser 










260 




265 270 


Asp 


His 


Phe 


Glu 


Thr 


He 


Met 


Lys Tyr Ala Glu Lys Leu He Gin 








275 






280 285 


Glu Gly 


Lys 


Ala 


Tyr 


Val 


Asp 


Asp Thr Pro Ala Glu Gin Met Lys 










290 






295 300 


Ala 


Glu 


Arg 


Glu 


Gin 


Arg 


He 


Glu Ser Lys His Arg Lys Asn Pro 
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305 



lie 


Glu 


Lys Asn 


Leu 


Gin 


Met Trp 










320 






Gin 


Phe 


Gly Gin 


Ser 


Cys 


Cys Leu 










335 






Ser 


Asn 


Asn Gly 


Cys 


Met 


Arg Asp 










350 






lie 


Gin 


Pro 


His 


Pro 


Arg 


Thr Gly 










365 






Thr 


Tyr 


Asp 


Phe 


Ala 


Cys 


Pro He 










380 






Thr 


His 


Ala 


Leu 


Arg 


Thr 


Thr Glu 










395 






Phe 


Tyr 


Trp 


He 


He 


Glu 


Ala Leu 








410 






Trp Glu 


Tyr 


Ser 


Arg 


Leu 


Asn Leu 










425 






Arg 


Lys 


Leu 


Thr 


Trp 


Phe 


Val Asn 










440 






Asp Asp 


Pro 


Arg 


Phe 


Pro 


Thr Val 










455 






Met 


Thr 


Val 


Glu 


Gly 


Leu 


Lys Gin 










470 






Ser 


Arg 


Ser 


Val 


Val 


Asn 


Met Glu 










485 






Asn 


Lys 


Lys 


Val 


He 


Asp 


Pro Val 










500 






Leu 


Lys 


Lys 


Glu 


Val 


He 


Pro Val 










515 






Glu 


Met 


Lys 


Glu 


Val 


Ala 


Lys His 










530 






Leu 


Lys 


Pro 


Val 


Trp 


Tyr 


Ser Pro 










545 






Asp 


Ala 


Glu 


Thr 


Phe 


Ser 


Glu Gly 










560 






Trp 


Gly 


Asn 


Leu 


Asn 


He 


Thr Lys 








575 






Lys 


He 


He 


Ser 


Leu 


Asp 


Ala Lys 










590 






Tyr 


Lys 


Lys Thr 


Thr 


Lys 


Val Thr 










605 






Ala 


Leu 


Pro 


He 


Pro 


Val 


He Cys 










620 






Thr 


Lys 


Pro 


Val 


Leu 


Gly 


Lys Asp 










635 






Asn 


Lys 


Asn 


Ser 


Lys 


His 


Glu Glu 










650 






Leu 


Lys 


Asp Leu 


Lys 


Lys 


Gly Asp 










665 






Gly 


Phe 


Phe 


He 


Cys 


Asp 


Gin Pro 










680 






Ser 


Cys 


Lys 


GlU 


Ala 


Pro 


Cys Val 








695 






His 


Thr 


Lys Glu 


Met 


Pro 


Thr Ser 










710 






Val 


Glu 


Ala 


Thr 


Lys 


Asn 


Glu Thr 










725 






Pro 


Thr 


Pro 


ser 


Leu 


Asn 


Asn Asn 










740 






Leu 


Val 


Leu 


Tyr 


Asn 


Arg 


Val Ala 










755 






Glu 


Leu 


Lys 


Ala 


Lys 


Lys 


Ala Pro 










770 






Val 


Lys 


Gin 


Leu 


Leu 


Ser 


Leu Lys 










785 






Gly 


Gin 


Glu Tyr 


Lys 


Pro 


Gly Asn 










800 






Asn 


He 


Ser 


Ser 


Asn 


Ser 


Ser Ala 





310 










315 


Glu 


Glu 


Met 


Lys 


Lys 


Gly 


Ser 




325 










330 


Arg 


Ala 


Lys 


He 


Asp 


Met 


Ser 




340 










345 


Pro 


Thr 


Leu 


Tyr 


Arg 


Cys 


Lys 




355 










360 


Asn 


Lys 


Tyr 


Asn 


Val 


Tyr 


Pro 




370 










375 


Val 


Asp 


Ser 


He 


GlU 


Gly 


Val 




385 










390 


Tyr 


His Asp Arg Asp 


Glu 


Gin 




400 










405 


Gly 


He Arg Lys 


Pro 


Tyr 


He 




415 










420 


Asn 


Asn 


Thr 


Val 


Leu 


Ser 


Lys 




430 










435 


Glu Gly Leu Val Asp 


Gly 


Trp 




445 










450 


Arg Gly Val 


Leu 


Arg 


Arg 


Gly 




460 










465 


Phe 


He 


Ala 


Ala 


Gin 


Gly 


Ser 




475 










480 


Trp 


Asp 


Lys 


He 


Trp 


Ala 


Phe 




490 










495 


Ala 


Pro 


Arg 


Tyr 


Val 


Ala 


Leu 




505 










510 


Asn 


Val 


Pro 


Glu 


Ala 


Gin 


GlU 




520 










525 


Pro 


Lys 


Asn 


Pro 


Glu 


Val 


Gly 




53 5 










540 


Lys 


Val 


Phe 


He 


GlU 


Gly 


Ala 


550 










555 


Glu 


Met 


Val 


Thr 


Phe 


He 


Asn 




565 










570 


He 


His 


Lys 


Asn 


Ala 


Asp 


Gly 




580 










585 


Leu 


Asn 


Leu 


Glu 


Asn 


Lys 


Asp 




595 










600 


Trp 


Leu 


Ala 


Glu 


Thr 


Thr 


His 


610 










615 


Val 


Thr 


Tyr 


GlU 


His 


Leu 


He 




625 










630 


Glu Asp 


Phe 


Lys 


Gin 


Tyr 


Val 




640 










645 


Leu 


Met 


Leu Gly Asp 


Pro 


Cys 




655 










660 


He 


He 


Gin 


Leu 


Gin 


Arg 


Arg 




670 










675 


Tyr 


Glu 


Pro 


Val 


Ser 


Pro 


Tyr 


685 










690 


Leu 


He 


Tyr 


He 


Pro 


Asp 


Gly 




700 










705 


Gly 


Ser 


Lys 


Glu 


Lys 


Thr 


Lys 




715 










720 


Ser 


Ala 


Pro 


Phe 


Lys 


Glu 


Arg 




730 










735 


Cys 


Thr 


Thr 


Ser 


GlU 


Asp 


Ser 


745 










750 


Val 


Gin Gly Asp Val 


Val 


Arg 




760 










765 


Lys 


Glu Asp Val 


Asp 


Ala 


Ala 




775 










780 


Ala 


Glu 


Tyr 


Lys 


GlU 


Lys 


Thr 




790 










795 


Pro 


Pro 


Ala 


Glu 


He 


Gly 


Gin 




805 










810 


Ser 


He 


Leu 


Glu 


Ser 


Lys 


Ser 
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815 






820 


825 


Leu Tyr 


Asp 


Glu Val 


Ala 


Ala 


Gin Gly Glu 


Val Val Arg Lys Leu 


830 






835 


840 


Lys Ala 


Glu 


Lys Ser 


Pro 


Lys 


Ala Lys He 


Asn Glu Ala Val Glu 




845 




850 


' 855 


Cys Leu 


Leu 


Ser Leu Lys 


Ala 


Gin Tyr Lys Glu Lys Thr Gly Lys 




860 






865 


870 


Glu Tyr 


He 


Pro Gly Gin 


Pro 


Pro .Leu Ser 


Gin Ser Ser Asp Ser 




875 






880 


885 


Ser Pro 


Thr 


Arg Asn 


Ser 


Glu 


Pro Ala Gly Leu Glu Thr Pro Glu 






890 






895 


900 


Ala Lys 


Val 


Leu Phe 


Asp 


Lys 


Val Ala Ser Gin Gly Glu Val Val 




905 




910 


915 


Arg Lys 


Leu 


Lys Thr 


Glu 


Lys 


Ala Pro Lys Asp Gin Val Asp He 




920 






925 


930 


Ala Val 


Gin 


Glu Leu 


Leu 


Gin 


Leu Lys Ala Gin Tyr Lys Ser Leu 






935 






940 


945 


lie Gly 


Val 


Glu Tyr 


Lys 


Pro 


Val Ser Ala Thr Gly Ala Glu Asp 




950 






955 


960 


Lys Asp 


Lys 


Lys Lys 


Lys 


Glu 


Lys Glu Asn 


Lys Ser Glu Lys Gin 


965 






970 


975 


Asn Lys 


Pro 


Gin Lys 


Gin 


Asn 


Asp Gly Gin Arg Lys Asp Pro Ser 




980 






985 


990 


Lys Asn 


Gin 


Gly Gly Gly 


Leu 


Ser Ser Ser Gly Ala Gly Glu Gly 




995 






1000 


1005 


Gin Gly 


Pro 


Lys Lys 


Gin 


Thr 


Arg Leu Gly 


Leu Glu Ala Lys Lys 




1010 






1015 


1020 


Glu Glu 


Asn 


Leu Ala Asp 


Trp 


Tyr Ser Gin 


Val He Thr Lys Ser 






1025 






1030 


1035 


Glu Met 


He 


Glu Tyr 


His 


Asp 


He Ser Gly Cys Tyr He Leu Arg 






1040 






1045 


1050 


Pro Trp 


Ala 


Tyr Ala 


He 


Trp 


Glu Ala He 


Lys Asp Phe Phe Asp 




1055 




1060 


1065 


Ala Glu 


He 


Lys Lys 


Leu 


Gly 


Val Glu Asn 


Cys Tyr Phe Pro Met 






1070 






1075 


1080 


Phe Val 


Ser 


Gin Ser 


Ala 


Leu 


Glu Lys Glu 


Lys Thr His Val Ala 






1085 






1090 


1095 


Asp Phe 


Ala 


Pro Glu 


Val 


Ala 


Trp Val Thr 


Arg Ser Gly Lys Thr 




1100 






1105 


1110 


Glu Leu 


Ala 


Glu Pro 


He 


Ala 


He Arg Pro 


Thr Ser Glu Thr Val 






1115 






1120 


1125 


Met Tyr 


Pro 


Ala Tyr 


Ala 


Lys 


Trp Val Gin 


Ser His Arg Asp Leu 




1130 






1135 


1140 


Pro lie 


Lys 


Leu Asn 


Gin 


Trp 


Cys Asn Val 


Val Arg Trp Glu Phe 




1145 






1150 


1155 


Lys His 


Pro 


Gin Pro 


Phe 


Leu 


Arg Thr Arg 


Glu Phe Leu Trp Gin 




1160 






1165 


1170 


Glu Gly 


His 


Ser Ala 


Phe 


Ala 


Thr Met Glu 


Glu Ala Ala Glu Glu 




1175 






1180 


1185 


Val Leu 


Gin 


He Leu Asp 


Leu 


Tyr Ala Gin 


Val Tyr Glu Glu Leu 






1190 






1195 


1200 


Leu Ala 


He 


Pro Val 


Val 


Lys 


Gly Arg Lys 


Thr Glu Lys Glu Lys 






1205 




1210 


1215 


Phe Ala 


Gly 


Gly Asp Tyr 


Thr 


Thr Thr He 


Glu Ala Phe He Ser 




1220 






1225 


1230 


Ala .ser 


Gly 


Arg Ala 


He 


Gin 


Gly Gly Thr 


Ser His His Leu Gly 




1235 






1240 


1245 


Gin Asn 


Phe 


Ser Lys 


Met 


Phe 


Glu He Val 


Phe Glu Asp Pro Lys 






1250 






1255 


1260 


lie pro 


Giy 


Glu Lys Gin 


rilB 


Ala Tyr Gin Asn Ser Trp Gly Leu 




1265 






1270 


1275 


Thr Thr 


Arg 


Thr He Gly 


Val 


Met Thr Met 


Val His Gly Asp Asn 




1280 






1285 


1290 


Met Gly 


Leu 


Val Leu 


Pro 


Pro 


Arg Val Ala 


Cys Val Gin Val Val 




1295 






1300 


1305 


lie He 


Pro 


Cys Gly 


He 


Thr 


Asn Ala Leu 


Ser Glu Glu Asp Lys 






1310 






1315 


1320 


Glu Ala 


Leu 


He Ala 


Lys 


Cys 


Asn Asp Tyr Arg Arg Arg Leu Leu 
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1325 








1330 






1335 


Ser 


Val 


Asn 


He Arg 


Val 


Arg 


Ala Asp Leu 


Arg Asp 


Asn 


Tyr Ser 








1340 






1345 






1350 


Pro 


Gly 


Trp 


Lys Phe 


Asn 


His 


Trp Glu Leu 


Lys Gly Val 


Pro He 








1355 








1360 






1365 


Arg 


Leu 


Glu 


Val Gly 


Pro 


Arg 


Asp Met Lys 


Ser Cys 


Gin 


Phe Val 








1370 








1375 






1380 


Ala 


Val 


Arg 


Arg Asp 


Thr 


Gly. 


Glu 


Lys Leu 


Thr Val 


Ala 


Glu Asn 








1385 








1390 






1395 


Glu 


Ala 


Glu 


Thr Lys 


Leu 


Gin 


Ala 


He Leu 


Glu Asp 


He 


Gin Val 








1400 








1405 






1410 


Thr 


Leu 


Phe 


Thr Arg 


Ala 


Ser 


Glu 


Asp Leu 


Lys Thr 


His 


Met Val 








1415 








1420 






1425 


Val 


Ala 


Asn 


Thr Met 


Glu 


Asp 


Phe 


Gin Lys 


He Leu Asp 


Ser Gly 








1430 






1435 






1440 


Lys 


lie 


Val 


Gin He 


Pro 


Phe 


Cys 


Gly Glu 


He Asp 


Cys 


Glu Asp 








1445 








1450 






1455 


Trp 


He 


Lys 


Lys Thr 


Thr 


Ala 


Arg 


Asp Gin 


Asp Leu 


Glu 


Pro Gly 






1460 








1465 






1470 


Ala 


Pro 


Ser 


Met Gly 


Ala 


Lys 


Ser 


Leu Cys 


He Pro 


Phe 


Lys Pro 








. 1475 








1480 






1485 


Leu 


Cys 


GlU 


Leu Gin 


Pro 


Gly 


Ala 


Lys Cys 


Val Cys 


Gly 


Lys Asn 








1490 








1495 






1500 


Pro 


Ala 


Lys 


Tyr Tyr 


Thr 


Leu 


Phe Gly Arg 


Ser Tyr 












1505 








1510 









<210> 49 

<211> 238 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No.: 2742442 

<400> 49 



Met 


Ala 


Ala 


Arg 


Thr 


Gly 


His 


Thr 


Ala 


Leu 


Arg Arg Val 


Val 


Ser 


1 






5 










10 










15 


Gly 


Cys 


Arg 


Pro 


Lys 


Ser 


Ala 


Thr 


Ala 


Ala 


Gly Ala Gin 


Ala 


Pro 






20 










25 










30 


Val 


Arg 


Asn 


Gly 


Arg 


Tyr 


Leu 


Ala 


Ser 


Cys 


Gly 


He 


Leu 


Met 


Ser 








35 










40 










45 


Arg 


Thr 


Leu 


Pro 


Leu 


His 


Thr 


Ser 


He 


Leu 


Pro Lys 


Glu 


He 


Cys 








50 










55 










60 


Ala 


Arg 


Thr 


Phe 


Phe 


Lys 


He 


Thr 


Ala 


Pro 


Leu 


He 


Asn 


Lys 


Arg 








65 










70 










75 


Lys 


Glu 


Tyr 


Ser 


Glu 


Arg 


Arg 


He 


Leu 


Gly 


Tyr 


Ser 


Met 


Gin 


Glu 








80 










85 










90 


Met 


Tyr 


Asp 


Val 


Val 
95 


Ser 


Gly 


Val 


Glu 


Asp 
100 


Tyr 


Lys 


His 


Phe 


Val 
105 


Pro 


Trp 


Cys 


Lys 


Lys 


Ser 


Asp 


Val 


He 


Ser 


Lys 


Arg 


Ser 


Gly 


Tyr 








110 










115 










120 


Cys 


Lys 


Thr 


Arg 


Leu 


GlU 


He 


Gly Phe 


Pro 


Pro 


Val 


Leu 


Glu 


Arg 






125 










130 










135 


Tyr 


Thr 


Ser 


Val 


Val 


Thr 


Leu 


Val 


Lys 


Pro 


His 


Leu 


Val 


Lys 


Ala 








140 










145 










150 


Ser 


Cys 


Thr 


Asp 


Gly 


Arg 


Leu 


Phe 


Asn 


His 


Leu 


GlU 


Thr 


He 


Trp 








155 










160 










165 


Cys 


Phe 


ser 


Pro 


Gly 


Leu 


Pro 


Gly Tyr 


Pro 


Arg Thr Cys 


Thr 


Leu 








170 










175 










180 


Asp 


Phe 


Ser 


He 


Ser 


Phe 


GlU 


Phe Arg 


Ser 


Leu 


Leu 


His 


ser 


Gin 








185 










190 










195 


Leu 


Ala 


Thr 


Leu 


Phe 


Phe 


Asp 


Glu 


Val 


Val 


Lys 


Gin 


Met 


Val 


Ala 



200 205 210 
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Ala Phe Glu Arg Arg Ala Cys Lys Leu Tyr Gly Pro Glu Thr Asn 
215 220 225 

He Pro Arg Glu Leu Met Leu His Glu Val His His Thr 
230 235 



<210> 50 

<211> 653 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No.: 3511087 

<400> 50 



Met 


Pro 


Phe 


Ser 


Ala 


Ser 


Leu 


Leu 


Gly Thr 


Leu Pro He Gly 


Ala 


1 








5 










10 




15 


Arg 


Tyr 


Ala 


Pro 


Pro 


Pro 


Ser 


Phe 


Ser 


Glu 


Phe Tyr Pro Pro 


Leu 






20 










25 




30 


Thr 


Ser 


Ser 


Leu 


Glu 


Asp 


Phe 


Cys 


Ser 


Ser 


Leu Asn Ser Phe 


Ser 










35 








40 




45 


Met 


Ser 


Glu 


Ser 


Lys 


Arg 


Asp 


Leu 


Ser 


Thr 


Ser Thr Ser Arg 


Glu 










50 










55 




60 


Gly Thr 


Pro 


Leu 


Asn 


Asn 


Ser 


Asn 


Ser 


Ser 


Leu Leu Leu Met 


Asn 










65 










70 




75 


Gly 


Pro 


Gly 


Ser 


Leu 


Phe 


Ala 


Ser 


Glu 


Asn 


Phe Leu Gly lie 


Ser 






80 










85 




90 


Ser 


Gin 


Pro 


Arg 


Asn 


Asp 


Phe 


Gly 


Asn 


Phe 


Phe Gly Ser Ala 


Val 








95 










100 




105 


Thr 


Lys 


Pro 


Ser 


Ser 


Ser 


Val 


Thr 


Pro 


Arg 


His Pro Leu Glu 


Gly 








110 










115 




120 


Thr 


His 


Glu 


Leu 


Arg 


Gin 


Ala 


Cys 


Gin 


He 


Cys Phe Val Lys 


Ser 










125 










130 




135 


Gly 


Pro 


Lys 


Leu 


Met 


Asp 


Phe 


Thr 


Tyr 


His 


Ala Asn He Asp 


His 










140 










145 




150 


Lys 


cys 


Lys 


Lys 


Asp 


He 


Leu 


He 


Gly Arg 


He Lys Asn Val 


Glu 










155 










160 




165 


Asp 


Lys 


Ser 


Trp 


Lys 


Lys 


lie 


Arg 


Pro 


Arg 


Pro Thr Lys Thr 


Asn 










170 










175 




180 


Tyr 


Glu 


Gly 


Pro 


Tyr 


Tyr 


He 


Cys 


Lys 


Asp 


Val Ala Ala Glu 


Glu 










lob 










190 




l y o 


Glu 


Cys 


Arg 


Tyr 


Ser 


Gly 


His 


Cys 


Thr 


Phe 


Ala Tyr Cys Gin 


Glu 








200 










205 




210 


Glu 


He 


Asp 


Val 


Trp 


Thr 


Leu 


Glu 


Arg 


Lys 


Gly Ala Phe Ser 


Arg 








215 










220 




225 


GlU 


Ala 


Phe 


Phe 


Gly 


Gly 


Asn 


Gly 


Lys 


He 


Asn Leu Thr Val 


Phe 










230 










235 




240 


Lys 


Leu 


Leu 


Gin 


GlU. 


His 


Leu 


Gly 


Glu 


Phe 


He Phe Leu cys 


Glu 








245 








250 




255 


Lys 


Cys 


Phe 


Asp 


His 


Lys 


Pro 


Arg 


Met 


He 


Ser Lys Arg Asn 


Lys 




260 










265 




270 


Asp Asn 


Ser 


Thr 


Ala 


Cys 


Ser 


His 


Pro 


Val 


Thr Lys His Glu 


Phe 










275 








280 




285 


Glu 


Asp 


Asn 


Lys 


Cys 


Leu 


Val 


His 


He 


Leu 


Arg Glu Thr Thr 


Val 






290 










295 




300 


Lys 


Tyr 


Ser 


Lys 


He 


Arg 


Ser 


Phe 


His 


Gly 


Gin Cys Gin Leu 


Asp 




305 










310 




315 


Leu 


Cys 


Arg 


His 


Glu 


Val 


Arg 


Tyr 


Gly Cys 


Leu Arg Glu Asp 


Glu 










320 










325 




330 


Cys 


Phe 


Tyr 


Ala 


His 


Ser 


Leu 


Val 


Glu 


Leu 


Lys Val Trp He 


Met 








335 










340 




345 


Gin 


Asn 


Glu 


Thr 


Gly 


He 


Ser 


His 


Asp Ala 


He Ala Gin Glu 


Ser 










350 










355 




3 60 


Lys 


Arg 


Tyr 


Trp 


Gin 


Asn 


Leu 


Glu 


Ala 


Asn 


Val Pro Gly Ala 


Gin 






365 










370 




375 


Val 


Leu 


Gly 


Asn 


Gin 


He 


Met 


Pro 


Gly 


Phe 


Leu Asn Met Lys 


He 
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380 



Lys 


Phe 


Val 


Cys 


Ala Gin Cys 
395 


Leu 


Arg 


Pro 


Asp 


Lys 


Asn 


Arg Lys Tyr 
410 


Cys 


Ser 


Trp Thr 


Lys 


Asp Arg Arg Ala 


Met 


Arg 










425 






Lys 


Lys 


Trp 


Met 


Asn He. Arg 
440 


Pro 


Leu 


Pro 


Leu 


Gin 


Phe Asp Leu Cys 


Asn 


His 










455 






Cys 


Gin 


Tyr 


Val Gly Asn Cys 


Ser 


Phe 










470 






Arg 


Glu 


Val 


Trp Thr Tyr Met 


Lys 


Glu 










485 






Glu 


Gin 


Phe 


Tyr 


Glu Leu Trp 
500 


Leu 


Lys 


Ser 


Glu 


Asp 


He 


Ala Ser Gin 


Ser 


Asn 








515 






He 


His 


Met 


Pro 


Thr Asp Tyr 
530 


Ala 


Glu 


Cys 


Trp 


Met 


Cys 


Gly Lys Asn 
545 


Cys 


Asn 


Gly His 


He 


Ser 


ser Glu Lys 


His 


Lys 










560 






Glu Asp 


Asp 


Gin 


Tyr Cys Trp 


Gin 


His 










575 






Phe 


Ser 


He 


Cys 


Asp Arg Tyr 
590 


Met 


Asn 


Asn 


Ser 


Cys 


Lys 


Phe Ala His 


Gly Asn 










605 






Glu 


Glu 


Arg 


Arg Asp Ala Leu 


Lys 


Met 










620 






Lys 


Asp 


His 


Leu 


He Gly Pro 


Asn 


Asp 








635 






Ser 


Phe 


Leu 


Phe 


Lys Asp Leu 


Asn 





650 
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385 




390 


Asn Gly Gin Val He 


Glu 


400 




405 


Ala Lys 


Ala Arg His 


Ser 


415 




420 


Val Met 


Ser He Glu 


Arg 


430 




435 


Pro Thr 


Lys Lys Gin 


Met 


445 




450 


He Ala 


Ser Gly Lys 


Lys 


460 




465 


Ala His 


Ser Pro Glu 


Glu 


475 




480 


Asn Gly 


He Gin Asp Met 


490 




495 


Ser Gin 


Lys Asn Glu 


Lys 


505 




510 


Lys Glu Asn Gly Lys 


Gin 


520 




525 


Val Thr 


Val Asp Phe 


His 


535 




540 


Ser Glu Lys Gin Trp Gin 


550 




555 


Glu Lys 


Val Phe His 


Thr 


565 




570 


Arg Phe 


Pro Thr Gly Tyr 


580 




585 


Gly Thr Cys Pro Glu Gly 


595 




600 


Ala Glu 


Leu His Glu 


Trp 


610 




615 


Lys Leu Asn Lys Ala 


Arg 


625 




630 


Asn Asp Phe Gly Lys 


Tyr 


640 




645 



<210> 51 

<211> 112 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No.: 1968009 

<400> 51 



Met 


Gin 


Asp 


Thr 


Gly 


Ser 


Val Val Pro Leu His Trp Phe Gly Phe 


1 






5 




10 15 


Gly 


Tyr 


Ala 


Ala 


Leu 


Val 


Ala Ser Gly Gly He He Gly Tyr Val 






20 




25 30 


Lys 


Ala 


Gly 


Ser 


Val 


Pro 


Ser Leu Ala Ala Gly Leu Leu Phe Gly 






35 




40 45 


Ser 


Leu 


Ala 


Gly 


Leu Gly 


Ala Tyr Gin Leu Ser Gin Asp Pro Arg 








50 




55 60 


Asn 


Val 


Trp 


Val 


Phe 


Leu 


Ala Thr Ser Gly Thr Leu Ala Gly He 








65 




70 75 


Met 


Gly 


Met 


Arg 


Phe 


Tyr 


His Ser Gly Lys Phe Met Pro Ala Gly 








80 




85 90 


Leu 


He 


Ala 


Gly 


Ala 


Ser 


Leu Leu Met Val Ala Lys Val Gly Val 








95 




100 105 


Ser 


Met 


Phe 


Asn 


Arg 
110 


Pro 


His 
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<210> 52 
<211> 114 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 1923127 

<400> 52 

Met Glu Lys Pro Leu Phe Pro Leu Val Pro Leu His Trp Phe Gly 
15 10 15 

Phe Gly Tyr Thr Ala Leu Val Val Ser Gly Gly lie Val Gly Tyr 

20 25 30 

Val Lys Thr Gly Ser Val Pro Ser Leu Ala Ala Gly Leu Leu Phe 

35 40 45 

Gly Ser Leu Ala Gly Leu Gly Ala Tyr Gin Leu Tyr Gin Asp Pro 

50 55 60 

Arg Asn Val Trp Gly Phe Leu Ala Ala Thr Ser Val Thr Phe Val 

65 70 75 

Gly Val Met Gly Met Arg Ser Tyr Tyr Tyr Gly Lys Phe Met Pro 

80 85 90 

Val Gly Leu He Ala Gly Ala Ser Leu Leu Met Ala Ala Lys Val 

95 100 105 

Gly Val Arg Met Leu Met Thr Ser Asp 
110 



<210> 53 
<211> 114 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No.: 3123954 

<400> 53 



Met 


Ala 


Ala 


He 


Pro 


Ser Ser Gly Ser 


Leu 


Val 


Ala 


Thr 


His 


Asp 


1 








5 


10 










15 


Tyr 


Tyr 


Arg 


Arg 


Arg 


Leu Gly Ser Thr 


Ser 


Ser 


Asn 


Ser 


Ser 


Cys 










20 




25 










30 


Ser 


Ser 


Thr 


Glu 


Cys 


Pro Gly Glu Ala 


He 


Pro 


His 


Pro 


Pro 


Gly 










35 




40 










45 


Leu 


Pro 


Lys 


Ala 


Asp 


Pro Gly His Trp Trp Ala 


Ser 


Phe 


Phe 


Phe 








50 




55 










60 


Gly 


Lys 


Ser 


Thr 


Leu 


Pro Phe Met Ala 


Thr 


Val 


Leu 


Glu 


Ser 


Ala 






65 




70 










75 


Glu 


His 


Ser 


Glu 


Pro 


Pro Gin Ala Ser 


Ser 


Ser 


Met 


Thr 


Ala 


Cys 










80 




85 










90 


Gly 


Leu 


Ala 


Arg 


Asp 


Ala Pro Arg Lys 


Gin 


Pro Gly Gly Gin 


Ser 






95 




100 










105 


Ser 


Thr 


Ala 


Ser 


Ala 


Gly Pro Pro Ser 















110 



<210> 54 
<2U> 291 
<212> PRT 

<213> Homo sapiens 

<220> 
<221> 
<223> 
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<400> 54 












Met Ser 


V7 -Lll VjJ.lL 


Gly 


Val 


Glu 


Leu 


Glu 


i 




5 














His 


Gly 


Lys 


Val 


Ser 






20 










U C U 116 V- 


At"(T TiVQ 

y Jf o 


Phe 


Gly 


Ser 


Asp 


His 






35 










He Val 


Tyr Gly 


Val 


Lys 


Gin 


Lys 


Asp 






50 












Asi3 Ala 


Gin 


Asp 


Pro 


Pro 


GlU 






65 










Asp He 


Gin Ala 


Val 


Ala 


Thr 


Ser 


Leu 






80 










noi i ijcu 


Attt Mot" 

fl<L y 11C l» 


Phe 


Gin 


Arg 


Ala 


Gin 






95 










Val Asp 


A T*rr n 1 ti 
r\. L. y ulii 


Phe 


Ala 


Cys 


Ser 


Ser 






110 










iii. y fix y 


Val Pro 


Gin 


Arg 


Lys 


Glu 


Val 






125 










rti y i_>j/ o 




Glu 


Pro 


Val 


Pro 


Ala 






140 










fiJ.Cl -L U 


IT alo 


Cys 


Pro 


Lys 


Cys 


Arg 






155 










Ala Vjin 


wet t»iy 


Ser 


Pro 


Ser 


Pro 


Cys 






170 










val iyr 


r/iO -L 111. 


Arg 


He 


Leu 


Pro 


Pro 




185 










Asp ATy 


Arg Ser 


Thr 


His 


Thr 


His 


Ser 






200 










Tyr Asn 


Arg Arg 


Glu 


Pro 


His 


Val 


Pro 






215 










Pro Lys 


Ser Arg 


Lys 


Gin 


Asn 


His 


Leu 






230 










Ser Asn 


Pro His 


He 


Ser 


Ser 


Gly 


Ser 






245 










Ser Gin 


Gly Gly 


Leu 


Leu 


Glu 


Asp 


Leu 






260 










Asp Leu 


Lys Glu 


Glu 


Glu 


Glu 


Glu 


Glu 






275 










Glu Gly 


Gly Pro 


Arg 


Glu 









290 



<210> 55 

<211> 610 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No.: 1867333 

<400> 55 



Met 
1 


Trp 


Leu 


Pro 


Leu 
5 


Val 


Leu 


Leu 


Leu 


val 


Leu 


Cys 


Lys 


Val 


Tyr 


Leu Gly Leu 










20 










Asn 


Pro 


Phe 


Ser 


Glu 


Asp 


Val 


Lys 


Arg 










35 










Thr 


Asp 


Lys 


Glu 


Ala 


Arg 


Lys 


Lys 


Val 










50 










Ala 


Asn 


Gin 


Val 


Pro 


Glu 


Lys 


Leu 


Asp 










65 










Gly 


Phe 


Gly 


Gly 


Leu 


Ala 


Ala 


Ala 


Ala 










80 










Lys 


Arg 


Val 


Leu 


Val 


Leu 


Glu 


Gin 


His 



2 
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Lys 


ser Val Arg Gly 


Leu 


10 




15 


Ser 


Lys Lys Ala Gly 


Ala 


25 




30 


Thr 


Gly Val Gly Arg 


Ser 


40 




45 


Gly 


Gin Glu Leu Ser 


Asn 


55 




60 


Asp 


Met Lys Gin Asp 


Arg 


70 




75 


Leu 


Pro Leu Thr Glu 


Ala 


85 




90 


Asp 


Asp Leu He Pro 


Ala 


100 




105 


cys 


Asp His Val Trp 


Trp 


115 




120 


Ser 


Arg Cys Arg Lys 


Cys 


130 




135 


Asp 


Lys Met Trp Gly 


Leu 


145 




150 


His 


Asn Phe Arg Gly 


Trp 


160 




165 


Tyr 


Gly Cys Gly Phe 


Pro 


175 




180 


Arg 


Trp Asp Arg Asp 


Pro 


190 




195 


Cys 


Ser Ala Ala Asp 


Cys 


205 




210 


Gly 


Thr Ser Cys Ala 


His 


220 




225 


Pro 


Lys Val Leu His 


Pro 


235 




240 


Thr 


Val Ala Thr Cys 


Leu 


250 




255 


Asp 


A^n Leu Tie Leu 


GlU 


265 




270 


Glu 


Glu Val Glu Asp 


Glu 


280 




285 


Ala 


Val Leu Leu Leu 


Ala 


10 




15 


Phe 


Ser Gly Ser Ser 


Pro 


25 




30 


Pro 


Pro Ala Pro Leu 


Val 


40 




45 


Leu 


Lys Gin Ala Phe 


Ser 


55 


60 


Val 


Val Val He Gly 


Ser 


70 




75 


He 


Leu Ala Lys Ala 


Gly 


85 




90 


Thr 


Lys Ala Gly Gly 


Cys 
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95 


100 


105 


Cys 


His 


Thr 


Phe 


Gly 


Lys Asn Gly Leu Glu Phe Asp Thr Gly 


He 










110 


115 


120 


His 


Tyr 


He 


Gly 


Arg 


Met Glu Glu Gly Ser He Gly Arg Phe 


He 






125 


130 


135 


Leu Asp 


Gin 


He 


Thr 


Glu Gly Gin Leu Asp Trp Ala Pro Leu 


Ser 










140 


145 


150 


Ser 


Pro 


Phe 


Asp 


He 


Met Val Leu Glu Gly Pro Asn Gly Arg 


Lys 








155 


160 


165 


Glu 


Tyr 


Pro 


Met 


Tyr 


Ser Gly Glu Lys Ala Tyr He Gin Gly 


Leu 








170 


175 


180 


Lys 


Glu 


Lys 


Phe 


Pro 


Gin Glu Glu Ala He He Asp Lys Tyr 


He 






185 


190 


195 


Lys 


Leu 


Val 


Lys 


Val 


Val Ser Ser Gly Ala Pro His Ala He 


Leu 






200 


205 


210 


Leu 


Lys 


Phe 


Leu 


Pro 


Leu Pro Val Val Gin Leu Leu Asp Arg 


Cys 








215 


220 


225 


Gly Leu 


Leu 


Thr 


Arg 


Phe Ser Pro Phe Leu Gin Ala Ser Thr 


Gin 










230 


235 


240 


Ser 


Leu 


Ala 


Glu 


Val 


Leu Gin Gin Leu Gly Ala Ser Ser Glu 


Leu 










245 


250 


255 


Gin 


Ala 


Val 


Leu 


Ser 


Tyr He Phe Pro Thr Tyr Gly Val Thr 


Pro 










260 


265 


270 


Asn 


His 


Ser 


Ala 


Phe 


Ser Met His Ala Leu Leu Val Asn His 


Tyr 










275 


280 


285 


Met 


Lys 


Gly 


Gly 


Phe 


Tyr Pro Arg Gly Gly Ser Ser Glu He 


Ala 






290 


295 


300 


Phe 


His 


Thr 


He 


Pro 


Val He Gin Arg Ala Gly Gly Ala Val 


Leu 










305 


310 


315 


Thr 


Lys 


Ala 


Thr 


Val 


Gin Ser Val Leu Leu Asp Ser Ala Gly 


Lys 








320 


325 


330 


Ala 


Cys 


Gly 


Val 


Ser 


Val Lys Lys Gly His Glu Leu Val Asn 


He 






335 


340 


345 


Tyr 


Cys 


Pro 


He 


Val 


Val Ser Asn Ala Gly Leu Phe Asn Thr 


Tyr 






350 


355 


360 


Glu 


His 


Leu 


Leu 


Pro 


Gly Asn Ala Arg Cys Leu Pro Gly Val 


Lys 










365 


370 


375 


Gin 


Gin 


Leu 


Gly 


Thr 


Val Arg Pro Gly Leu Gly Met Thr Ser 


Val 








380 


385 


390 


Phe 


He 


Cys 


Leu 


Arg 


Gly Thr Lys Glu Asp Leu His Leu Pro 


Ser 








395 


400 


405 


Thr 


Asn 


Tyr 


Tyr 


Val 


Tyr Tyr Asp Thr Asp Met Asp Gin Ala 


Met 










410 


415 


420 


Glu 


Arg 


Tyr 


Val 


Ser 


Met Pro Arg Glu Glu Ala Ala Glu His 


He 






425 


430 


435 


Pro 


Leu 


Leu 


Phe 


Phe 


Ala Phe Pro Ser Ala Lys Asp Pro Thr 


Trp 










440 


445 


450 


Glu 


Asp 


Arg 


Phe 


Pro 


Gly Arg Ser Thr Met He Met Leu He 


Pro 






455 


460 


465 


Thr 


Ala 


Tyr 


Glu 


Trp 


Phe Glu Glu Trp Gin Ala Glu Leu Lys 


Gly 








470 


475 


480 


Lys 


Arg 


Gly 


Ser 


Asp 


Tyr Glu Thr Phe Lys Asn Ser Phe Val 


Glu 








485 


490 


495 


Ala 


Ser 


Met 


Ser 


Val 


Val Leu Lys Leu Phe Pro Gin Leu Glu 


Gly 










500 


505 


510 


Lys 


Val 


Glu 


Ser 


Val 


Thr Ala Gly Ser Pro Leu Thr Asn Gin 


Phe 








515 


520 


525 


Tyr 


Leu 


Ala 


Ala 


Pro 


Arg Gly Ala Cys Tyr Gly Ala Asp His 


Asp 








530 


535 


540 


Leu Gly 


Arg 


Leu 


His 


Pro Cys Val Met Ala Ser Leu Arg Ala 


tain 








545 


550 


555 


Ser 


Pro 


He 


Pro 


Asn 


Leu Tyr Leu Thr Gly Gin Asp He Phe 


Thr 










560 


565 


570 


Cys Gly 


Leu 


Val 


Gly 


Ala Leu Gin Gly Ala Leu Leu Cys Ser 


Ser 










575 


580 


585 


Ala 


He 


Leu 


Lys 


Arg 


Asn Leu Tyr Ser Asp Leu Lys Asn Leu 


Asp 








590 


595 


600 


Ser 


Arg 


He 


Arg 


Ala 


Gin Lys Lys Lys Asn 
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605 



<210> 56 








<211> 352 








<212> PRT 








<213> Homo sapiens 




<220> 








<221> misc — feature 




<223> Incyte ID 


No. : 


1461451 










py-A Afrr Val 


Arg Gly 


Arg Trp Val 


1 
J. 




5 




Qa-r Ala Hi c 
oex r\±cL n±s> 


Ala 


Ser 


Asp Glu He 






20 




lip xrllti V^y J? 


Thr 


Glu 


Ala Leu Val 






35 




Leu Val Ala 


Ala 


Ala 


Leu Leu Val 






50 




rlL y del y 


Gly Arg 


Ala Ala Ser 






65 




Ion Pin Pin 


Leu 


Ala 


Gly Ala Gly 






80 




Leu Glu Pro 


Glu 


Glu 


Pro Arg Ala 






95 




Arg Asp jjeu 


Gly 


Ser 


Arg Leu Gin 






110 




vai nia i rp 


Ala 


Glu 


Ala Asp Glu 






125 




Leu Ala Gin 


Glu 


Glu 


Glu Gly Val 






140 




T.on Cor 

JJGU Jcl uiy 


Lys 


He 


Gly Ala Lys 






155 




T A/C Pin Ala 
Liy o vj X 11 /\ x d 


Arg 


LyS' 


Ala Gin Arg 






170 




vj> X U. U1U niy 


Lys 


Arg 


Leu Glu Ser 






185 




T.ue P.1 ii fill! 


Glu 


Arg 


Leu Arg Leu 




200 




Glu Glu Arg 


Lys 


Ala 


Arg Glu Glu 






215 




Glu Tyr Leu 


Lys 


Leu 


Lys Glu Ala 






230 




Val Gly Glu 


Thr 


Met 


Thr Glu Glu 




245 




Glu Phe lie 


Asn 


Tyr 


He Lys Gin 






260 




Asp Leu Ala 


Ser 


Gin 


Val Gly Leu 






275 




Arg lie Gin 


Asp 


Leu 


Leu Ala Glu 




290 




Asp Asp Arg 


Gly Lys 


Phe He Tyr 






305 




Ala Val Ala 


Asn 


Phe 


He Arg Gin 






320 




Glu Leu Ala 


Gin 


Ala 


Ser Asn Ser 






335 




Ser Pro Ala 


Gin 


Ala 


Pro Ala 






350 





<210> 57 

<211> 216 

<212> PRT 

<213> Homo sapiens 



610 



Ala 


His 


Ala Ser Ala 


His 


Ala 




10 






15 


Pro 


Ala 


Ser Gly Trp 


Thr 


Gin 




25 






30 


Met 


Val 


Ala Pro Val 


Trp 


Tyr 




40 






45 


Gly 


Phe 


He Leu Phe 


Leu 


Thr 


55 






60 


Ala 


Gly 


Gin Glu Pro 


Leu 


His 




70 






75 


Arg 


Val 


Ala Gin Pro 


Gly Pro 




85 






90 


Gly 


Gly 


Arg Pro Arg 


Arg Arg 




100 






105 


Ala 


Gin 


Arg Arg Ala 


Gin Arg 




115 






120 


Asn 


Glu 


Glu Glu Ala 


Val 


He 




130 






135 


Glu 


Lys 


Pro Ala Glu 


Thr 


His 




145 






150 


Lys 


Leu 


Arg Lys Leu 


Glu 


Glu 




160 






165 


Glu 


Ala 


Glu Glu Ala 


Glu 


Arg 




175 






180 


Gin 


Arg 


Glu Ala Glu 


Trp 


Lys 




190 






195 


Glu 


Glu 


Glu Gin Lys 


Glu 


Glu 




205 






210 


Gin 


Ala 


Gin Arg Glu 


His 


Glu 




220 






225 


Phe 


Val 


Val Glu Glu 


GlU 


Gly 




235 






240 


Gin 


Ser 


Gin Ser Phe 


Leu 


Thr 




250 






255 


Ser 


Lys 


Val Val Leu 


Leu 


Glu 




265 






270 


Arg 


Thr 


Gin Asp Thr 


He 


Asn 




280 






285 


Gly 


Thr 


He Thr Gly 


Val 


He 




295 






300 


He 


Thr 


Pro Glu Glu 


Leu 


Ala 




310 






315 


Arg 


Gly 


Arg Val Ser 


He 


Ala 




325 






330 


Leu 


He 


Ala Trp Gly 


Arg 


Glu 




340 






345 
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<220> 

<221> misc_feature 

<223> Incyte ID No.: 2345712 



<400> 57 



Tyr 


Asp 


Pro 


He Gly Phe 


Gly 


Leu Ser Trp Glu Ala Gly Arg He 


1 






5 






10 15 


lie 


Gly 


Trp 


Gly 


Lys 


Pro 


Thr 


Arg Gly Arg Gly Arg Gly Gly Ser 




20 






25 30 


Leu 


Ser 


Thr 


Arg Gly Arg 


Gly 


Ser Glu Val Pro Asp Ser Ala His 










35 






M rt AC 

40 4b 


Leu 


Ala 


Pro 


Thr 


Pro 
50 


Leu 


Phe 


Ser Glu Ser Gly Cys Cys Gly Leu 
55 60 


Arg 


Ser 


Arg 


Phe 


Leu 


Thr 


Asp 


Cys Lys Met Glu Glu Gly Gly Asn 






65 






70 75 


Leu Gly 


Gly 


Leu 


He 


Lys 


Met 


Val His Leu Leu Val Leu Ser Gly 








80 






85 90 


Ala 


Trp 


Gly 


Met 


Gin 


Met 


Trp 


Val Thr Phe Val Ser Gly Phe Leu 






95 






100 105 


Leu 


Phe 


Arg 


Ser 


Leu 


Pro 


Arg 


His Thr Phe Gly Leu Val Gin Ser 








110 






115 120 


Lys 


Leu 


Phe 


Pro 


Phe 


Tyr 


Phe 


His He Ser Met Gly Cys Ala Phe 








125 




130 135 


lie 


Asn 


Leu 


Cys 


He 


Leu 


Ala 


Ser Gin His Ala Trp Ala Gin Leu 








140 






145 150 


Thr 


Phe 


Trp 


Glu 


Ala 


Ser 


Gin 


Leu Tyr Leu Leu Phe Leu Ser Leu 








155 






160 165 


Thr 


Leu 


Ala 


Thr 


Val 
170 


Asn 


Ala 


Arg Trp Leu Glu Pro Arg Thr Thr 
175 180 


Ala 


Ala 


Met 


Trp Ala Leu 


Gin 


Thr Val Glu Lys Glu Arg Gly Leu 










185 






190 195 


Gly Gly 


Glu 


Val 


Pro 


Gly 


Ser 


His Gin Gly Ser Asp Pro Tyr Arg 










200 




205 210 


Gin 


Leu 


Arg 


Glu 


Lys 
215 


Asp 







<210> 58 

<211> 292 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 1810320 

<400> 58 



Met 


Ala 


Gin 


Pro 


Pro 


Pro 


Asp 


Val Glu Gly Asp Asp Cys Leu Pro 


1 








5 




10 15 


Ala 


Tyr 


Arg 


His 


Leu 


Phe 


Cys 


Pro Asp Leu Leu Arg Asp Lys Val 








20 






25 30 


Ala 


Phe 


He 


Thr 


Gly Gly 


Gly 


Ser Gly He Gly Phe Arg He Ala 










35 






40 45 


Glu 


He 


Phe 


Met 


Arg 


His 


Gly 


Cys His Thr Val lie Ala Ser Arg 










50 




55 60 


Ser 


Leu 


Pro 


Arg 


Val 


Leu 


Thr 


Ala Ala Arg Lys Leu Ala Gly Ala 








65 






70 75 


Thr 


Gly 


Arg 


Arg 


Cys 


Leu 


Pro 


Leu Ser Met Asp Val Arg Ala Pro 






80 






85 90 


Pro 


Ala 


Val 


Met 


Ala 
95 


Ala 


Val 


Asp Gin Ala Leu Lys Glu Phe Gly 
100 105 


Arg 


He 


Asp 


He 


Leu 


He 


Asn 


Cys Ala Ala Gly Asn Phe Leu Cys 






110 






115 120 


Pro 


Ala 


Gly 


Ala 


Leu 


Ser 


Phe 


Asn Ala Phe Lys Thr Val Met Asp 








125 






130 135 


He 


Asp 


Thr 


Ser 


Gly Thr 


Phe 


Asn Val Ser Arg Val Leu Tyr Glu 
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140 






145 


150 


Lys 


Phe 


Phe Arg 


Asp 


His 


Gly Gly Val 


He 


Val Asn He Thr Ala 






155 






160 


165 


Thr 


Leu 


Gly Asn 


Arg Gly 


Gin Ala Leu 


Gin 


Val His Ala Gly Ser 






170 






175 


180 


Ala 


Lys 


Ala Ala 


Val 


Asp 


Ala Met Thr 


Arg 


His Leu Ala Val Glu 






185 






190 


195 


Trp Gly 


Pro Gin 


Asn 


He 


Arg Val. Asn 


Ser 


Leu Ala Pro Gly Pro 








200 






205 


210 


lie 


Ser 


Gly Thr 


Glu Gly 


Leu Arg Arg 


Leu 


Gly Gly Pro Gin Ala 








215 






220 


225 


ser 


Leu 


Ser Thr 


Lys 
230 


Val 


Thr Ala Ser 


Pro 
235 


Leu Gin Arg Leu Gly 
240 


Asn 


Lys 


Thr Glu 


He 


Ala 


His Ser Val 


Leu 


Tyr Leu Ala Ser Pro 






245 






250 


255 


Leu 


Ala 


Ser Tyr 


Val 


Thr 


Gly Ala Val 


Leu 


Val Ala Asp Gly Gly 






260 






265 


270 


Ala 


Trp 


Leu Thr 


Phe 


Pro 


Asn Gly Val 


Lys 


Gly Leu Pro Asp Phe 






275 






280 


285 


Ala 


Ser 


Phe Ser 


Ala 
290 


Lys 


Leu 







<210> 59 
<211> 158 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 964996 

<400> 59 



Glu 


Gly 


Gly 


Pro Ser Trp 


Thr Arg Glu 


Arg 


Thr Leu Val 


Ala 


Val 


1 




5 






10 






15 


Lys 


Pro 


Asp 


Gly Val Gin 


Arg 


Arg Leu 


Val 


Gly Asp Val 


He 


Gin 






20 






25 






30 


Arg 


Phe 


Glu 


Arg Arg Gly 


Phe 


Thr Leu 


Val 


Gly Met Lys 


Met 


Leu 






35 






40 






45 


Gin 


Ala 


Pro 


Glu Ser Val 
50 


Leu 


Ala Glu 


His 
55 


Tyr Gin Asp 


Leu 


Arg 
60 


Arg 


Lys 


Pro 


Phe Tyr Pro 


Ala 


Leu He 


Arg 


Tyr Met Ser 


Ser 


Gly 




65 






70 






75 


Pro 


Val 


Val 


Ala Met Val 


Trp Glu Gly 


Tyr 


Asn Val Val 


Arg 


Ala 








80 






85 






90 


Ser 


Arg 


Ala 


Met He Gly 


His 


Thr Asp 


Ser 


Ala Glu Ala 


Ala 


Pro 






95 






100 






105 


Gly 


Thr 


He 


Arg Gly Tyr 


Phe 


Ser Val 


His 


lie Ser Arg Asn 


Val 






110 






115 






120 


He 


His 


Ala 


Ser Asp Ser 


Val 


Glu Gly 


Ala 


Gin Arg Glu 


He 


Gin 








125 




130 






135 


Leu 


Trp 


Phe 


Gin Ser Ser 


Glu 


Leu Val 


Ser 


Trp Ala Asp Gly Gly 






140 






145 






150 


Gin 


His 


Ser 


Ser He His 
155 


Pro 


Ala 











<210> 60 
<211> 559 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701884305H1 
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<400> 60 

ggaaacctaa acgcgcgtgc gcttcttcca 
tctccaaagt gactgaacaa tgcagaagga 
tggtttcggc tatgcggccc tggtggctac 
tagtgtgccg tccctggctg ctggactctt 
ccagctgtct caggacccca ggaacgtgtg 
tggcattatg gggatgagat tctacaactc 
gggagccagt ttgctgatgg ttgccaaact 
gtagtagcca tagtcctgcg tgggctcatg 
accacgctga agagataaga acagcaaaga 
ggttactata gtgaccgtc 



cgccacggaa accgtgcagg cctggtgtgg 60 
cagtggccca ctggttcctt tacattatta 120 
tggtgggatt attggctatg caaaagcagg 180 
ctttgggggc ctggcaggcc tgggtgccta 240 
ggttttccta gctacgtctg ggactttggc 3 00 
tgggaaattt atgcctgcag gtttgatcgc 3 60 
tggacttagt atgttgagtt caccccatcc 420 
atgagttgac actctccagt cctccacatt 480 
cctacactga gcacatggag gcgaagacgt 540 

559 



<210> 61 
<211> 326 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701607951H1 

<400> 61 

gtgttgggtg tgttcttact ttgcggattt 
gcgcgcgggc gctctcccgt gcactctctg 
acgtcctgta actgcggttc cgggagtgga 
cacggaaacc gtgcaggcct ggtgtggtct 
tggcccactg gttcctttac attattatgg 
tgggattatt ggctatgcaa aagcag 



taccaccctg gaattgttcc gtacgcgcag 60 
ctgagctagc ggactgcccg cctctctaaa 120 
aacctaaacg cgcgtgcgct tcttccacgc 180 
ccaaagtgac tgaacaatgc agaaggacag 240 
tttcggctat gcggccctgg tggctactgg 300 

326 



<210> 62 
<211> 333 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> incyte ID No.: 701644253H1 

<400> 62 

aacgtcctgt aactgcggtt ccgggagtgg 
gccacggaaa accgtgcagg cctngtgtgg 
cagtggccca ctggntcctt tacattatta 
tggtgggatt attggctatg caaaagcagg 
ctttgggggc ctggcaggcc tgggtgccta 
ggttttccta gctacgnctg ggactttggc 



aaacctaaac gcgcgtgcgc tttcttccac 60 

tctccanagt gactgaacaa tgcagaagga 120 

tggtttcggc tatgcggccc tggtggctac 180 

tagtgtgccg tccctggctg ctggactctt 240 

ccagctgtct caggacccca ggaacgtgtg 300 

tgg 333 



<210> 63 
<211> 318 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701513151H1 

<400> 63 

cttactttgc ggattttacc accctggaat 
cccgtgcact ctctgctgag ctagcggact 
ggttccggga gtggaaacct aaacgcgcgt 
ggcctggtgt ggtctccaaa gtgactgaac 
tttacattat tatggtttcg gctatgcggc 



tgttccgtac gcgcangngc gcggggctct 60 

gcccgcctct ctaaaacgtc ctgtaactgc 120 

gcgcttcttc cacgccacgg aaaccgtgca 180 

aatgcagaag gacagtggcc cactggttcc 240 

cctggtggct actggtggga ttattggcta 300 
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tgcaaaagca ggtagtgt 



PC1YUS00/31743 
318 



<210> 64 
<211> 315 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701652337H1 

<400> 64 

cagcncaggc ctccgggctc cagctccggt gttgggtnca ggcctggtgt ggtctccaaa 60 

gtgactgaac aatgcagaag gacagtggcc cactggttcc tttacattat tatggtttcg 120 

gctatgcggc cctggtggct actggtggga ttattggcta tgcaaaagca ggtagtgtgc 180 

cgtccctggc tgctggactc ttctttgggg gcctggcagg cctgggtgcc taccagctgt 240 

ctcaggaccc caggaacgtg tgggttttcc tagctacgtc tgggactttg gctggcatat 3 00 

ggggatgaga ttcta 315 



<210> 65 
<211> 313 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701562183H1 

<400> 65 

ggtctccaaa gtgactgaac aatgcagaag 
tatggtttcg gctatgcggc cctggtggct 
ggtagtgtgc cgtccctggc tgctggactc 
taccagctgt ctcaggaccc caggaacgtg 
gctggcatta tggggatgag attctacaac 
gcgggancat ttt 



gacagtggcc cactggttcc tttacattat 60 
actggtggga ttattggcta tgcaaaagca 120 
ttctttgggg gcctggcagg cctgggtgcc 180 
tgggttttcc tagctacgtc tgggactttg 240 
tctgggaaat ttatgcctgc aggtttgatc 300 

313 



<210> 66 
<211> 304 
<212> DNA 

<213> Rattus norvegicus 



<220> misc_feature 
<223> Incyte ID No. 

<400> 66 



700227356H1 



cgccgtcgtc 
tgtggtctcc 
tattatggtt 
gcaggtagtg 
gcctaccagc 
ttgg 



ctccagcgca 
aaagtgactg 
tcggctatgc 
tgccgtccct 
tgtctcagga 



ggcctccggg 
aacaatgcag 
ggccctggtg 
ggctgctgga 
ccccaggaac 



ctccagctcc ggtgttgggt gcaggcctgg 60 
aaggacagtg gcccactggt tcctttacat 120 
gctactggtg ggattattgg ctatgcaaaa 180 
ctcttctttg ggggcctggc aggcctgggt 240 
gtgtgggttt tcctagctac gtctgggact 300 

304 



<210> 67 
<211> 327 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701649802H1 
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<400> 67 

ctccggtgtt gggtgcaggc ctggtgtggt ctccaaagtg actgaacaat gcagaaggac 60 
agtggaccac tggttcctta cattattatg gtttcggcta tgcggccctg gtggctactg 120 
gtgggattat tgnctttgca aaagcaggta gtgtgccgtc cctggctgtt ggactcttct 180 
ttgggggcct ggcaggcctg ggtgcctacc agctgtctca ggaccccagg aacgtgtggg 240 
ttttcctagc tacgtctggg actttggctg gcattatggg gatgagattc tacaactctg 300 
ggaaatttat gcctgcagtt tgatcgc 327 



<210> 68 
<211> 305 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700226414H1 

<400> 68 

gccgtcgtcc tccagcncag gcctccgggc 
gtggtctcca aagtgactga acaatgcaga 
attatggttt cggctatgcg gccctggtgg 
caggtagtgt gccgtccctg gctgctggac 
cctaccagct gtctcaggac cccaggaagt 
gctgg 



tccagctccg gtgttgggtg caggcctggt 60 
aggacagtgg cccactggtt cctttacatt 120 
ctactggtgg gattattggc tatgcaaaag 180 
tcttctttgg gggcctggca ggcctgggtg 240 
gtgggttttc ctagctacgt ctgggacttg 300 

305 



<210> 69 
<211> 295 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700275094H1 

<400> 69 

tcctccagcn caggcntccg ggctccagct 
ccaaagtgac tgaacaatgc agaaggacag 
tttcggctat gcggccctgg tggctactgg 
tgtgccgtcc ctggctgctg gactcttctt 
agctgtctca ggaccccagg aacgtgtggg 



ccggtgttgg gtgcaggcct ggtgtggtct 60 
tggcccactg gttcctttac attattatgg 120 
tgggattatt ggctatgcaa aagcaggtag 180 
tggggggcct ggcaggcctg ggtgcctacc 240 
ttttcctagc tacgtctggg atttg 295 



<210> 70 
<211> 301 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700226425H1 

<400> 70 

cctgacctct gttcctgtgc tcccgccgtc 
tccggtgttg ggtgcaggcc tggtgtggtc 
gtggcccact ggttccttta cattattatg 
gtgggattat tggctatgca aaagcaggta 
ttgggggcct ggcaggcctg ggtgcctacc 
t 



gtcctccagc gcaggcctcc gggctccagc 60 
tccaaagtga ctgaacaatg cagaaggaca 120 
gtttcggcta tgcggccctg gtggctactg 180 
gtgtgccgtc cctggctgct ggactcttct 240 
agctgtctca ggaccccagg aacgtgtggg 300 

301 



<210> 71 
<211> 282 
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<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700275207H1 

<400> 71 

tcctccagcg caggcctccg ggctccagct 
ccaaagtgac tgaacaatgc agaaggacag 
tttcggctat gcggccctgg tggctactgg 
tgtgccgtcc ctggctgctg gactcttctt 
gctgtctcag gaccccagga acgtgtgggt 



ccggtgttgg gtgcaggcct ggtgtggtct 60 

tggcccactg gttcctttac attattatgg 120 

tgggattatt ggctatgcaa aagcaggtag 180 

tgggggcctg gcaggcctgg gtgcctacca 240 

tttcctagct ac 282 



<210> 72 
<211> 282 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701507568H1 

<400> 72 

cgccgtcgtc ctccagcgca ggcctccggg 
tgtggtctcc aaagtgactg aacaatgcag 
tattatggtt tcggctatgc ggccctggtg 
gcaggtagtg tgccgtccct ggctgctgga 
gcctaccagc tgtctcagga ccccaggaac 



ctccagctcc ggtgttgggt gcaggcctgg 60 
aaggacagtg gcccactggt tcctttacat 120 
gctactggtg ggattattgg ctatgcaaaa 180 
ctcttctttg ggggcctggc aggcctgggt 240 
gtgtgggttt tc 282 



<210> 73 
<211> 281 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc„f eature 

<223> Incyte ID No.: 700300118H1 

<400> 73 

cgccgtcgtc ctccagcgca ggcctccggg 
tgtggtctcc aaagtgactg aacaatgcag 
tattatggtt tcggctatgc ggccctggtg 
gcaggtagtg tgccgtccct ggctgctgga 
gcctaccagc tgtctcagga ccccaggaac 



ctccagctcc ggtgttgggt gcaggcctgg 60 

aaggacagtg gcccactggt tcctttacat 120 

gctactggtg ggattattgg ctatgcaaaa 180 

ctcttctttg ggggcctggc aggcctgggt 240 

gtgtgggttt t 281 



<210> 74 
<211> 292 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700301710H1 

<400> 74 

cctgnacctc tgttcctgtg ctcccgccgt 

ctccggtgtt gggtgcaggc ctggtgtggt 

agtggcccac tggttccttt acattattat 

ggtgggatta ttggctatgc aaaagcaggt 

tttgggggcc tggcaggcct gggtgcctac 



cgtcctccag cgcaggcctc cgggctccag 60 

ctccaaagtg actgaacaat gcagaaggac 120 

ggtttcggct atgcggccct ggtggctact 180 

agtgtgccgt ccctggctgc tggactcttc 240 

cagctgtctc aggaccccag ga 292 
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<210> 75 
<211> 289 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700064344H1 

<400> 75 

cagcgcaggc ctccgggctc cagctccggt 
ccaccctgga attgttccgt acgcgcaggc 
gagctagcgg actgcccgcc tctctaaaac 
cctaaacgcg cgtgcgcttc ttccacgcca 
aaagtgatga acatgcagaa ggacantggc 



gttgggtgtg ttcttacttt gcggatttta 60 
gcgcgggcgc tctcccgtgc actctctgct 120 
gtcctgtaac tgcggttccg ggagtggaaa 180 
cggaaaccgt gcaggcctgg tgtggtctcc 240 
ccactggttc ttanatatt 289 



<210> 76 
<211> 276 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc„feature 

<223> Incyte ID No.: 701423273H1 

<400> 76 

agcgcaggcc tcagggctcc agctccggtg ttgggtgcag gcctggtgtn gtctccaaag 60 
tgactgaaca atgcagaagg acagtggccc actggttcct ttacattatt atggtttcgg 120 
ctatgcggcc ctggtggcta ctggtgggat tattggctat gcaaaagcag gtagtgtgcc 180 
gtccctggct gctggactct tctttggggg cctggcaggc ctgggtgcct accagctgtc 240 
tcaggacccc aggaacgtgt gggttttcct agctac 276 



<210> 77 
<211> 293 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700225847H1 

<400> 77 

ccgtcgtcct ccagcncagg cctccgggct ccagctccgg tgttgggtgc aggcctggtg 60 

tggtctccaa agtgactgaa caatgcagaa ggacagtggc ccactggttc ctttacatta 120 

ttatggtttc ggctatgcgg ccctggtggc tactggtggg attattggct atgcaaaagc 180 

aggtagtgtg ccgtccctgg ctgctggact ctctttgggg gcctggcang cctgggtgcc 240 

taccagctgt ctcaggaccc cagaacgtgt gggtttccta gctacgtctg gga 293 



<210> 78 
<211> 274 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701462776H1 

<400> 78 

tgctcccgcc gtcgtcctcc agcgcaggcc tccgggctcc agctccggtg ttgggtgcag 60 
gcctggtgtg gtctccaaag tgactgaaca atgcagaagg acagtggcnc actggttcct 120 
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ttacattatt atggtttcgg ctatgcggcc ctggtggcta ctggtgggat tattggctat 180 
gcaaaagcag gtagtgtgcc gtccctggct gctggactct tctttggggg cctggcaggc 240 
ctgggtgcct accagctgtc tcaggacccc agga 274 



<210> 79 
<211> 282 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700916803H1 

<400> 79 

gtgctcccgc cgtcgtcctc cagcgcaggc 
gttcttactt tgcggatttt accaccctgg 
tctcccgtgc actctctgct gagctagcgg 
tgcggttccg ggagtggaaa cctaaacgcg 
gcaggcctgg tgtggtctcc aaagtgactg 



ctccgggctc cagctnccgg tgttgggtgt 60 
aattgttccg tacgcgcagg cgcgcggggc 120 
actgcccgcc tctctaaaac gtcctgtaac 180 
cgtgcgcttc ttccacgcca cggaaaccgt 240 
aacaatgcag aa 282 



<210> 80 
<211> 280 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700478141H1 

<400> 80 

gccgtcgtcc tccagcgcag gcctccgggc 
gtggtctcca aagtgatgaa caatgcagaa 
ttatggtttc ggctatgcgg ccctggtggc 
aggtagtgtg ccgtccctgg ctgctggact 
ctaccagctg tctcaggacc ccaggaacgt 



tccagctccg gtgttgggtg caggcctggt 60 
ggacagtggc ccactggttc ctttacatta 120 
tactggtggg attattggct atgcaaaagc 180 
cttctttggg ggcctggcag gcctgggtgc 240 
gtgggttttc 280 



<210> 81 
<211> 299 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701646690H1 

<400> 81 

tncctccngg ctccagctcc ggtgttgggt 
aacaatgcan aangacagtn gcccactggt 
ngccctggtg gctactggtg ggattattgg 
ggctgntgga ntcttctttg ggggcctggc 
ccccaggaac gtgtgggttt tcctagctac 



gcaggcctgg tgtggtctcc aaagtnactg 60 

tcctttacnt tattatggtt tcngntatgc 120 

ctatgcaaaa ncaggtagtg tgccgtccct 180 

aggcctgggt gcctaccagc tgtctcagga 240 

gtctggnact ttggctggca tatggggat 299 



<210> 82 
<211> 286 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> mi sc_ feature 

<223> Incyte ID No.: 701624261H1 
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<400> 82 

tctcctccac aggtgcangc ctggtgtggt ctccaaagtg actgnncaat gcagaaggac 60 

agtggcccac tggttccttt acattattat ggtttcggct atgcggccct ggtggctact 120 

ggtgggatta ttggctatgc aaaagcaggt agtgtgccgt ccctggctgc nngactcttc 180 

tttgggggcc tggcaggcct gggtgcctac cagctgtctc aggaccccag gaacgtgtgg 240 

gttttcctag ctacgtctgg gactttggct ggcattatgg ggatga 286 



<210> 83 
<211> 266 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700912920H1 

<400> 83 

gcagaaggac agtggcccac tggttccttt acattattat ggtttcggct atgcggccct 60 

ggtggctact ggtgggatta ttggctatgc aaaagcaggt agtgtgccgt ccctggctgc 120 

tggactcttc tttgggggcc tggcaggcct gggtgcctac cagctgtctc aggaccccag 180 

gaacgtgtgg gttttcctag ctacgtctgg gactttggct ggcattatgg ggatgagatt 240 

ctacaactct gggaaattta tgcctg 266 



<210> 84 
<211> 262 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701482566H1 

<400> 84 

ctggctgctg gactcttctt tgggggcctg 
gaccccagga acgtgtgggt tttcctagct 
atgagattct acaactctgg gaaatttatg 
ctgatggttg ccaaacttgg acttagtatg 
ccctgcgtgg gctcatgatg ag 



gcaggcctgg gtgcctacca gctgtctcag 60 
acgtctggga ctttggctgg cattatgggg 120 
cctgcaggtt tgatcgcggg agccagtttg 180 
ttgagttcac cccatccgta gtagccatag 240 

262 



<210> 85 
<211> 285 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700270272H1 

<400> 85 

ctgttcctgt gctcccgccg tcgtcctcca 
tgggtgcagg cntgntgtgg tctccaaagt 
ctggttcctt tacattatta tggtttcggc 
attggctatg caaaagcagg tagtgtgccg 
cctggcaggc ctgggtgcct accagctgtc 



gcncaggcct ccgggctcca gctccggtgt 60 
gactgaacaa tgcagaagga cagtggccca 120 
tatgcggccc tggtggctac tggtgggatt 180 
tccctggcct gctggactct tctttggggg 240 
tcaggacc cc aggaa 285 



<210> 86 
<211> 268 
<212> DNA 
<213> Rattus 



norvegicus 
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<220> 

<221> misc_feature 

<223> incyte ID No.: 700628520H1 

<400> 86 

ctccagcnca ggcctccggg ctccagctcc 
aaagtgactg aacaatgcag aaggacagtg 
tcggctatgc ggccctggtg gctactggtg 
tgccgtccct ggctgctgga ctcttctttg 
tgtctcagga ccccaggaac gtgtgggt 



ggtgttgggt gcaggcctgg tgtggtctcc 60 
gcccactggt tcctttacat tattatggtt 120 
ggattattgg ctatgcaaaa gcaggtagtg 180 
ggggcctggc aggcctgggt gcctaccagc 240 

268 



<210> 87 
<211> 269 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700534975H1 

<400> 87 

tgctcccgcc gtcgtcctcc agcgcaggcc 
gcctggtgtg gtctccaaag tgactgaaca 
ttacattant atggtttcgg ctatgcggcc 
gcaaaagcag gtagtgtgcc gtccctggct 
ctgggtgcct accagctgtc tcaggaccc 



tccgggctcc agctccggtg ttgggtgcag 60 
atgcagaagg acagtggctc actggttcct 120 
ctggtggcta ctggtgggat tattggctat 180 
gctggactct tctttggggg cctggcaggc 240 

269 



<210> 88 
<211> 262 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_ feature 

<223> Incyte ID No.: 700176004H1 

<400> 88 

tagtgtgccg 60 
ccagctgtct 120 
tggcattatg 180 
gggagccagt 240 
262 



tatgcngccc 
tccctggctg 
caggacccca 
gggatgagat 
ttgctgatgg 



tggtggctac 
ctggactctt 
ggaacgtgtg 
tctacaactc 
ttgccaaact 



tggtgggatt 
ctttgggggc 
ggttttccta 
tgggaaattt 
tg 



attggctatg 
ctggcaggcc 
gctacgtctg 
atgcctgcag 



canaagcagg 
tgggtgccta 
ggactttggc 
gtttgatcgc 



<210> 89 
<211> 349 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> mis cofeature 

<223> Incyte ID No.: 701609236H1 

<400> 89 

cgtacgcgca ggcgcgcggg gctctcccgt 
gcctctctaa aacgtcctgt aactgcggtt 
ttcttccacg ccacggaaac cgtgcaggcc 
cagaaggaca gtggcccact ggttccttta 
gtggctactg gtgggatatt ggctatgcaa 
ctctcttggg ggctngcagc ctggtgctaa 



gcactctctg gctgagcnng cggactgccc 60 
ccgggagtgg aaacctaaac gcgcgtgcgc 120 
tggtgtggtc tccaaagtga ctgaacaatg 180 
cattattatg gtttcggcta tgcggccctg 240 
aagcagtatg tgccgtccct ggctgctgga 300 
caactgtctc agancccag 349 
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<210> 90 
<211> 263 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 70.1473437H1 

<400> 90 

agcncaggcc tccgggctcc agctccggtg ttgggtgcag gcctggtgng gtctccaaag 60 

tgactgaaca atgcagaagg acagtggccc actggttcct ttacattatt atggtttcgg 120 

ctatgcggcc ctggtggcta ctggtgggat tattggctat gcaaaagcag gtagtgtgcc 180 

gtccctggct gctggactct tctttggggg cctggcaggc ctgggtgcct accagctgtc 240 

tcaggacccc aggaacgtgt ggg 263 



<210> 91 
<211> 303 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701606089H1 

<400> 91 

gcgcaggcct ccggggctcc agctccggtg ttgggtgcag gcctggtgtg gtctccaaag 60 

tgactgaaca atgcagaagg acgttngccc actggntcct ttacattatt atggtttcgg 120 

ctatgcggcc ctggtggcta ctggtgggan tattggctat gcaaaagcag gtagtgtgcc 180 

gtccctngct gctggactct tctttngggg cctgncangc ctgggtgcct accagctgtc 240 

tcangacccc aggaacgtgt gggttttccn agctacgtct gggatttgnc tggcatatng 300 



<210> 92 
<211> 273 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> mi sc_ feature 

<223> Incyte ID No.: 701736525H2 

<400> 92 

taactgctcc gacctctcct ccacaggtgc aggcctggtg tggtctccaa agtgactgaa 60 

caatgcagaa ggacagtggc ccactggttc ctttacatta ttatggtttc ggctatgcgg 120 

ccctggtggc tactggtggg attattggct atgcaaaagc aggtagtgtg ccgtccctgg 180 

ctgctggact cttctttggg ggcctggcag gcctgggtgc ctaccagctg tctcaggacc 240 

ccaggaacgt gtgggttttc ctagctacgt ctg 273 



<210> 93 
<211> 262 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701532848H1 

<400> 93 

cngccgtcnt cctccagcgc angcntccgg gctccagctc cggtgttggg tgcaggcctg 60 
gtgtggtctc caaagtgact gaacaatgca gaaggacagt ggcncactgg ttcctttaca 120 
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ttattatggt ttcggctatg cggccctggt ggctactggt gggattattg gctatgcaaa 180 
agcaggtagt gtgccgtccc tggctgctgg actcttcttt gggggcctgg caggcctggg 240 
tgcctaccag ctgtctcagg ac 262 



<210> 94 
<211> 247 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700181220H1 

<400> 94 

aaaacgtcct gtaactgcgg ttccgggagt ggaaacctaa acgcgcgtgc gcttcttcca 60 

cgccacggaa accgtgcagg cctggtgtgg tctccaaagt gactgaacaa tgcagaagga 120 

cagtggccca ctggttcctt tacattatta tggtttcggc tatgcggccc tggtggctac 180 

tggtgggatt attggctatg caaaagcagg tagtgtgccg tccctggctg ctggactctt 240 

ctttggg 247 



<210> 95 
<211> 284 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701462707H1 

<400> 95 

tacacacccg gctcctgacc tctgttcctg tgctcccgcc gtcgtcctcc agcgcaggcc 60 

tccgggctcc agctccggtg ttgggtgcan gcctggtgtg gtctccaaag tgactgaaca 120 

atgcagaagg acagtggccc actggttcct ttacattatt atggtttcgg ctatgcggcc 180 

ctggtggcta ctggtgggat tattggctat gcaaaagcag gtagtgtgcc gtccctggct 240 

gctggactct tctttggggg cctggcaggc ctgggtgcct acca 284 



<210> 96 
<211> 282 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701462863H1 

<400> 96 

tacacacccg gctcctgacc tctgttcctg tgctcccgcc gtcgtcctcc agcgcaggcc 60 

tccgggctcc agctccggtg ttgggtgcag gcctggtgtg gtctccaaag tgactgaaca 120 

atgcagaagg acagtggccc actggttcct ttacattatt atggtttcgg ctatgcggcc 180 

ctggtggcta ctggtgggat tattggctat gcaaaagcag gtagtgtgcc gtccctggct 240 

gctggactct tctttggggg cctggcaggc ctgggtgcct ac 282 



<210> 97 
<211> 281 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701481465H1 
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<400> 97 

ttcttaactg ctccgacctc tcctccacag 
tgaacaatgc agaaggacag tggcccactg 
gcggccctgg tggctactgg tgggattatt 
ctgggctgct ggactcttct ttgggggcct 
ggaccccagg aacgtgtggg ttttcctagc 



gtgcaggcct ggtgtggtct ccaaagtgac 60 
gttcctttac attattatgg tttcggctat 120 
ggctatgcaa aagcaggtag tgtgccgtcc 180 
ggcaggcctg ggtgcctacc agctgtctca 240 
tacgtctggg a 281 



<210> 98 
<211> 265 
<212> DNA 
<213> Rattus 



norvegicus 



<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701308467H1 

<400> 98 



tgttcctgtg ctcccgccgt cgtcctccag cgcaggcctc cgggctccag ctccggngtt 60 

gggtgcaggc ctggtgtggt ctccaaagtg actgaacaat gcagaaggac agtggcccac 120 

tggttccttt acattattat ggtttcggct atgcggccct ggtggctact ggtgggatta 180 

ttggctatgc aaaagcaggt agtgtgccgt ccctggctgc tggactcttc tttgggggcc 240 

tgnagnctgg gtgcctacca gctgt 265 



<210> 99 
<211> 291 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No,: 701564368H1 



<400> 99 



gggggcctgg caggcctggg tgcctaccag ctgtctcagg accccaggaa cgtgtgggtt 60 

ttcctagcta cgtctgggac tttggctggc attatgggga tgagattcta caactctggg 120 

aaatttatgc ctgcaggttt gatcgcggga gccagtttgc tgatggttgc caaacttgga 180 

cttagtatgt tgagttcacc ccatccgtag tagccatagt cctgcgtggg ctcatgatga 240 

gttgacactc tccagtcctc cacattacca cgctgaagag ataagaacag c 291 



<210> 100 
<211> 271 
<212> DNA 

<213> Rattus norvegicus 



<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700533180H1 



<400> 100 



caggtagtgt gccgtccctg gctgctggac 
gcctaccagc tgtcctcagg aaccccagga 
ctttggctgg cattatgggg atgagattct 
tgatcgcggg agccagtttg ctgatggttg 
cccatccgta gtagccatag tcctgcgtgg 



tcttctttgg gggcctggca ggcctgggtg 60 
acgtgtgggt tttcctagct acgtctggga 120 
acaactctgg gaaatttatg cctgcaggtt 180 
ccaaacttgg acttagtatg ttgagttcac 240 
a 271 



<210> 101 
<211> 255 
<212> DNA 
<213> Rattus 



norvegicus 
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<220> 

<221> misc_feature 

<223> Incyte ID No.: 700124647H1 

<400> 101 

ccgtcgtcct ccagcncagg cctccgggct 
tggtctccaa agtgactgaa caatgcagaa 
ttatggtttc ggctatgcgg ccctggtggc 
aggtagtgtg ccgtccctgg ctgctggatc 
tannage tgt ctcaa 



ccagctccgg tgttgggtgc aggcctggtg 60 
ngacagtggc ccactggttc ctttacatta 120 
tactggtggg attattggct atgeaaaage 180 
ttctttgggg gectggcagg cctgggtgcc 240 

255 



<210> 102 
<211> 297 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_Ceature 

<223> Incyte ID No.: 700537020H1 

<400> 102 

gccctggtgg ctactggtgg gattattggc 
getgenggae tcttctttgg gggcctggca 
ccccaggaac gtgtgggttt tcctagctac 
agattctaca actctgggaa atttatgect 
atggttgcca aacttggact tagtatgttg 



tatgeaaaag caggtagtgt gccgtccctg 60 
ggcctgggtg cctacnagct aggctcagga 120 
tctggaccnt nggctggcat tatggggatg 180 
gcaggtttga tegegggage cagtttgctg 240 
agttcacccc ateegtagta gecatag 297 



<210> 103 
<211> 261 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700765205H1 

<400> 103 

gacctctgtt cctgtgctcc cgccgtcgtc 
ggtgttgggt geaggectgg tgtggtctcc 
gcccactggt tcctttacat tattatggtt 
ggattattgg etatgeaaaa gcaggtagtg 
ggggcctggc aggctgggtg c 



ctccagcgca ggcctccggg ctccagctcc 60 
aaagtgactg aacaatgeag aaggacagtg 120 
teggctatge ggccctggtg gctactggtg 180 
tgccgtccct ggctgctgga ctcttctttg 240 

261 



<210> 104 
<211> 312 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> incyte ID No.: 701942992H1 

<400> 104 

cgacgtctac ncacccggct cctgacctct 
gcaggcctcc gggctccagc tccggtgttg 
ctgaacaatg cagaaggaca gtggcccact 
tgcggccctg gtggctactg gtgggattat 
cctggctgct ggactcttct ttgggggcct 
ncccaggnan nt 



gttcctgtgc tcccgccgtc gtcctccagc 60 
ggtgcaggcc tggtgtggtc tccaaagtna 120 
ggttccttta cattattatg gttteggcta 180 
tggctatgea aaagcaggta gtgtgccgtc 240 
ggcagcctgg ggcctacaag ttttntcagg 300 

312 
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<210> 105 
<211> 241 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701197694H1 

<400> 105 

tgctcccgcc gtcgtcctcc agcgcaggcc 
gcctggtgtg gtctccaaag tgactgaaca 
ttacattatt atggtttcgg ctatgcggcc 
gcaaaagcag gaacgtgtgg gttttcctag 

g 



tccgggctcc agctccggtg ttgggtgcag 60 
atgcagaagg acagtggccc actggttcct 120 
ctggtggcta ctggtgggat tattggctat 180 
ctacgtctgg gactttggct ggnattatgg 240 

241 



<210> 106 
<211> 268 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701024952H1 

<400> 106 

cccggctcct gacctctgtt cctgtgctcc cgccgtcgtc ctccagcgca ggcctccggg 60 
ctccagctcc ggtgttgggt gcaggcctgg tgtggtctcc aaagtgactg nacaatgcag 120 
aaggncagtg gcccactggt tcctttacat tattatggtt tcggctatgc ggccctggtg 180 
gctactggtg ggattattgg ctatgcaaaa gcaggtagtg tgccgtccct ggctgctgga 240 
ctctnctttn ggggcctggc aggcttag 268 



<210> 107 
<211> 318 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701582676H1 

<400> 107 

gcctaccagc tgtctcagga ccccaggaac gtgtgggttt tcctagctac gtctgggact 60 

ttggctggca ttatggggat gagattctac aactctggga aatttatgcc tgcaggtttg 120 

atcgcgggag ccagtttgct gatggttgcc aaacntggac ttagtatgtt gagttcaccc 180 

catccgtagt agccatagtc ctgcgtgggc tcatgatgag ttgacactct ccagtcctcc 240 

acattaccac gctgaagaga taagaacagc aaagacctac actgagcaca tggaggcgaa 300 

gacgtggtta ctatagtg 318 



<210> 108 
<211> 255 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701293154H1 

<400> 108 

ggattattgg ctattgcaaa agcaggtaag tgtgccgtcc ctggctgctg gactcttctt 60 
tgggggcctg gcaggcctgg gtgcctacca gctgtctcag gaccccagga acgtgtgggt 120 
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tttcctagct acgtctggga ctttggcttg cattatgggg atgagattct acaactctgg 180 
gaaatttatg cctgcaggtt tgatcgcggg agccagtttg ctgatggttg ccaaacttgg 240 
attagtatgt tgagg 255 



<210> 109 
<211> 254 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701298824H1 

<400> 109 

catgcgcagg cctccgggct ccatgctccg 
aaagngactg aacaatgcag aaggacagtg 
tcggctatgc ggccctggtg gctactggtg 
tgccgccctg gctgctggac tcttctttgg 
ctcaggaccc agga 



gtgttgggtg catggcctgg tgnggtctcc 60 
gcccactggt tcctttacat tattatggnt 120 
ggattattgg ctatgcaaaa gcnggtagtg 180 
gggcctgcag nctgggtgcc taccagctgt 240 

254 



<210> 110 
<211> 294 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700524204H1 

<400> 110 

tcaggacccc aggaacgtgt gggttttcct agctacgtct gggactttgg ctggcattat 60 
ggggatgaga ttctacaact ctgggaaatt tatgcctgca ggtttgatcg cgggagccag 120 
tttgctgatg gttgccaaac ttggacttag tatgttgagt tcaccccatc cgtagtagcc 180 
atagccctgc gtgggctcat gatgagttga cactctccag tcctctacat taccacgctg 240 
aagagataag aacagcaaag acctacactg agcacatgga ggcgaagagt ggtt 294 



<210> 111 
<211> 289 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_ feature 

<223> Incyte ID No.: 700067537H1 

<400> 111 

gacgtctaca cacccggctc ctgacctctg ttcctgtgct cccgccgtcg tcctccagcg 60 
caggcctccg ggctccagct ccgctgttgg gtgcaggcct ggtgtggtct ccaaagtgac 120 
tgaacaatgc agaaggacag tggcccactg gttcctttac attattatgg tttcggctat 180 
gcggccctgg tggctactgg tgggattatt ggctatgcaa aagcagtagt gtgccgtccc 240 
tggctgctgg atcttctttg ggggctggca ggctgggtgc ctacaactg 289 



<210> 112 
<211> 276 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701258019H1 
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<400> 112 



tgttcctgtg ctcccgccgt cgtcctccag 
gggtgcaggc ctggtgtggt ctccaaagtg 
actggttcct ttacattatt atggnttccg 
ttattggcta tgcaaaagca ggtagtgtgc 
cctgcagnct ggtgcctacc agctgctctg 



cgcaggcctc cgggctccag ctccggtgtt 60 

actgaacaat gcatgaagga cagttggccc 120 

gctatgcggc cctggtggct actggtgnga 180 

cgccctggct gctggactct tctttggggg 240 

cgtngg 276 



<210> 113 
<211> 254 
<212> DNA 
<213> Rattus 



norvegicus 



<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700532493H1 



<400> 113 



tcangacccc aggaacgtgt gggttttcct 
ggggctgaga ttctacaact ctgggaaatt 
tttgctgatg gttgccaaac ttggacttag 
atagccctgc gtgggctcat gatgagttgc 
gagatanaac age a 



agetaegtet gggactttgg cnggcattat 60 
tatgectgea ggtttgatcg egggagecag 120 
tatgttgagt tcaccccatc egtagtagee 180 
atctccagtc ctctacatta ccacgctgaa 240 

254 



<210> 114 
<211> 282 
<212> DNA 
<213> Rattus 



norvegicus 



<220> 

<221> misc_feature 

<223> Incyte ID No.: 700523302H1 



<400> 114 



ctccagcnca ggcctccggg ctccagctcc 
aaagtgactg aacaatgeag aaggacagtg 
gggtanangn ncccgnnnng nnaagggggn 
ccgtccctgg ctgctggact cttctttggg 
tctcaggacc ecaggaaegg tgggtttccn 



ggtgttgggt geaggectgg tgtggtctcc 60 
gcccactggt tcctttanat aatnatggtc 120 
atnttgnnnt acgnaagagc ngntagtgtg 180 
ggcctggcag gcctgggtgc ctaccagctg 240 
agctacgncg gg 282 



<210> 115 
<211> 256 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701242719H1 



<400> 115 



cacacccggc tcctgacctc tgttcctgtg 
cgggctccag ctccgntgtt gggtgcaggc 
gcagaaggac agtggcccac tggttccttt 
ggnggctact ggtgggatta ttggctatca 
actcttcttt ggggee 



ctcccgccgn cgncctccag cgcaggcctc 60 
ctggtgtggt ctccaaagtg actgaacaat 120 
acattattat ggtttcggct atgcggccct 180 
aaagcaggta gtgtgccgcc ctggctgtgg 240 

256 



<210> 116 
<211> 244 
<212> DNA 
<213> Rattus 



norvegicus 
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<220> 

<221> misc_feature 

<223> Incyte ID No.: 701226025H1 



<400> 116 



cattattatg gtttcggcta tgcggccctg 

aaagcaggta gtgtgccgcc ctggctgctg 

tgcctaccag ctgctcagga ccccaggaac 

ttgctggcat tatggggatg agattctaca 
cgcg 



gtggctactg gtgggattat tggctatgca 60 
nctcttcttt ggaggcctgg caggcctggg 120 
gtgtgggttt tcctagctac gtctgggact 180 
actctgggaa atttatcctg caggtttgat 240 

244 



<210> 117 
<211> 262 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701293276H1 



<400> 117 



cgtctacaca cccggctcct gacctctgtt 
aggcctccgg gctccagctc cggtgttggg 
gaacaatgca gaaggacagt ggcccactgg 
ggcccttggt ggctactggt gggattattg 
tggctgtgga ctctctntgn gg 



cctgtgctcc cgcccgtcgt cctccagcgc 60 
tgcaggcctg gngtggtctc caaagtgact 120 
ttcctttaca ttattatggt ttcggctatc 180 
gctatgcaaa agcaggtagt gtgccgtccc 240 

262 



<210> 118 
<211> 261 
<212> DNA 
<213> Rattus 



norvegicus 



<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700493358H1 



<400> 118 



caggcctggt gtggtctcca aagtgactga 

cctttacatt attatggttt cggctatgcg 

tatgcaaaag caggtagtgt gccgtccctg 

ggcctgggtn canacnantg tctaggnccc 

ggnnttggnc canaaangga a 



acaatgcaga aggacagtgg cccactggtt 60 
gccctggtgg ctactggtgg gattattggc 120 
gctgctggac tcttctttgg ggncntggca 180 
caagaaangt gggttnccca aannaggggg 240 

261 



<210> 119 
<211> 265 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700533285H1 



<400> 119 



ccttgaactc atttcttcct gactgctaga 
tcctccacag gtgcaggcct ggtgtggtct 
tggcccactg gttcctttac attattatgg 
tgggattatt ggtatgcaaa agcaggtagt 
gggggcctgg caggcctggg tgcct 



ggcctgtgtg ttcttaactg ctccgacctc 60 
ccaaagtgac tgaacaatgc agaaggacag 120 
tttcggctat gcggccctgg tggctactgg 180 
gtgccgtccc tggctgctgg actcttcttt 240 

265 
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<210> 120 
<211> 247 
<212> DNA 
<213> Rattus 



norvegicus 



<220> 

<221> misc_feature 

<223> Incyte ID No.: 700920823H1 



<400> 120 



cgtgnacgtc tacacacccg gctcctgacc tctgttcctg tgctcccgcc gtcgtcctcc 60 
agcgcaggcc tcccgggctc cagctccggt gttgggtgca ggcctggtgt ggtctccaaa 120 
gtgactgaac aatgcagaag gacagtggcc cactggttcc tttacattat tatggtttcg 180 
gctatgcggc cctggtggct actggtggga ttattgctat gcaaaagcag gtagtctgcc 240 
gctccct 247 



<210> 121 
<211> 263 
<212> DNA 

<213> Rattus norvegicus 



<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700627607H1 



<400> 121 



gacgtctaca cacccggctc ctgacctctg ttcctgtgct cccgccgtcg tcctccagcg 60 

caggcctccg ggctccagct ccggtgttgg gtgcaggcct ggtgtggtct ccaaagtgac 120 

tgaacaatgc agaaggacag tggcccactg gttcctttac attattatgg tttcggctat 180 

gcggccctgg tggctactgg tgggattatt ggctatgcaa anccagntat cgccggcncn 240 

ggcnanctcg nnccgaggng nnc 263 



<210> 122 
<211> 265 
<212> DNA 

<213> Rattus norvegicus 



<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700437944H2 

<400> 122 



ctccgntgtt gggtgcaggc ctggtgtant 

cantggccca ctggttcctt tacattattn 

tgntgggatt attggctatn caaaagcagg 

ctttgggggc ctgacaggct gggtgcctac 

ngttttccta agctacntct gggac 



ctccaaagtg actgaacaat gaagcaggac 60 
tngtttcggc tatncggccc tgntngctac 120 
tagtgtnccg tccctggctg ctggactctt 180 
cagctgtctc angcacccca ggaacgtgtg 240 

265 



<210> 123 
<211> 343 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

0<223> Incyte ID No.: 701582848H1 

<400> 123 

gctaccagct gtctcaggac ccaggaacgt gtgggtttcc tagctacgtc tgggactttg 60 

gctggcatta tggggatgag attctacnac tctgggaaat ttatgcctgc aggtttgatc 120 

gcgggagcca nttgctgata gttgccaact tngacttagt atgttgagtn caccccatcc 180 
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gtagtagcat ancctgcgtg ggctcagatg agtnacactc tccaggcctc cacatttacc 240 
aggctgaaga gtaagacagc aaagactaca tgagcacntg aggnaaacgt ggttntatat 300 
gacgttcaag acgcgatgnt gactcagact ncntgctcat egg 343 



<210> 124 
<211> 241 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701305531H1 



<400> 124 



gaegtctaca cacncggctc ctgacctctg 
caggcctccg ggctccagct ccggagttgg 
tgaacaatgc agaaggacag tggcccactg 
gcggccctgg tggctactgg tggattattg 

g 



ttcctgngct cccgccgncg acctccagcg 60 
gtgeaggect ggngtgnnct ccaaagtgac 120 
gttcctttac attattatgg atteggctat 180 
gctatcaaaa gcaggagtgt ccgccctgct 240 

241 



<210> 125 
<211> 155 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700916103H1 

<400> 125 



gtgctcccgc cgtcgtcctc cagcgcaggc ctccgggctc cagctccggt gttgggtgca 60 
ggcctggtgt ggtctccaaa gtgactgaac aatgcagaag gacagtggcc cactggttcc 120 
tttacattat tatggtttcg getatgegge cctgg 155 



<210> 126 
<211> 185 
<212> DNA 

<213> Rattus norvegicus 
*220> 

<221> mi sc_f eature 

<223> Incyte ID No.: 701294764H1 

<400> 126 

ccgccgtcgt ccttcagcgc aaggnctccg 
ggagtggnct ccaaagtgac tgaacaatgc 
attattatgg ttteggctat gcggccctgg 
agegg 



ggctccagct ccggagttgg gngcaggect 60 
agaaggacan tggcccactg gntcctttac 120 
aggenactgg gggnatattg getatneaaa 180 

185 



<210> 127 
<211> 125 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700066710H1 

<400> 127 

ctcttctttg ggggectgne caggctgggt gcctaccagc tgtctcagga ccccaggaac 60 
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gtgtgggttt tcctagctac gtctgggact ttggctggca ttatggggat gagattctac 120 
aactc 125 



<210> 128 
<211> 266 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701471559H1 

<400> 128 

tttatgcctg cnggtttgat cgcgggagcc agtttgctga tggttgccaa acttggactt 60 

agtatgttga gttcacccca tccgtagtag ccatagccct gcgtgggctc atgatgagtt 120 

gacactctcc agtcctctac attaccacgc tgaagagata agaacagcaa agacctacac 180 

tgagcacatg gaggcgaaga cgtggttact atagtgaccg ttcagagntg gcgagtgtct 240 

gacctcagag ctcacactgc cttcat 266 



<210> 129 
<211> 208 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700325006H1 

<400> 129 

ggcaggcctg ggtgcctacc agctgtctca 
ctacgtctgt gactttggct gancattatt 
ttatgcctgc aggtttnatc gcggncancc 
tagtangntn anttcacccc ntgccgtc 



ggacnccagg nacgtgtggg ntttcctaga 60 
ngggatgana ttctaacaac tctgggaaat 120 
agtttgnntg atggttgcca aacttggact 180 

208 



<210> 130 
<211> 263 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701258479H1 

<400> 130 

gcagagctag ggcgagcaag tggctgtgtg ttcaagggcc agttgcatcc gcacccagtg 60 
cttgtacctt gaactcattt cttcctgact gctagaggcc tgtgtgttct taactgctcc 120 
gacctctcct ccacaggtgc aggcctggtg tggnctccaa agtgactgaa caatgcagaa 180 
ggacagtggc ccactggctc ctttacatta ttatggnttc ggctatgcgg cctggtggct 240 
actggnggna ttattggcta tgc 263 



<210> 131 
<211> 258 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc„feature 

<223> Incyte ID No.: 700627187H1 
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<400> 131 



aatttatgcc tgcaggttga tcgcnggagc cagtttgctg atggttgcca aacttngact 60 

taggatgttg agttcacccc atcccggagt agccatagtc ctgcgtgggc tcatgatgag 120 

ttgacactct ccagtcctcc acattaccac gctgaagaga taagaacagc aaagacctac 180 

actgagcaca tggaggcgaa gacgtggtta ctatagtgac cgttcagaga cggcgagtgt 240 

ctgactcaga gctcacac 258 



<210> 132 
<211> 272 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701246066H1 



<400> 132 



gcgggagcca gtttgctgat ggntgccaaa cttggactta gnatgttgag ntcaccccnt 60 
ncgtagtagc catagtcctg cgtggtctca tgatgagttg acactctcca gtcctncaca 120 
Ottaccacgct gaagagatan gaacagcaaa gacctacact gagcacatgg aggcgaagac 180 
gtggttacta tagtgaccgt tcagagacgg cgagtgtctg acctcagagc tcacactgct 240 
tcatgcggct tgntcttgtg catgatgctc ng 272 



<210> 133 
<211> 253 
<212> DNA 
<213> Rattus 



norvegicus 



<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700594190H1 



<400> 133 



atccgtagta gccatagccc tgcgtgggct catgatgagt tgacactctc cagtcctcta 60 

cattaccacg ctgaagagat aagaacagca aagacctaca ctgagcacat ggaggcgaag 120 

acgtggttac tatagtgacc gttcagagac ggcgagtgtc tgacctcaga gctcacactg 180 

ccttcatgcg gcttgttctt gtgtcatgat gtctcgactc tctgtactac tacataaagg 240 

ggtaaaatgt tgg 253 



<210> 134 
<211> 267 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700627108H1 

<400> 134 



gaattgatnc ctggcaggtt gatcgcggga 
gncttngtat ctgagttcaa cccnatcggt 
tgnnttgaac actctccagt cagtccagat 
aagacctaca ctgagcacca tggaggcgaa 
acggcgngtg tntggatcan agatcca 



gccagttttg ctgatggttg acaaactttg 60 
agtagccata agtctanccn gggntcatga 120 
naacgncgct gntagagatn aagaccagcn 180 
gacgtggtta ctataagtga ccgttcagag 240 

267 



<210> 135 
<211> 650 
<212> DNA 
<213> Rattus 



norvegicus 
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<220> 

<223> RnAUG.conN 
<400> 135 

gtgctcccgc cgtcgtcctc cagcgcaggc ctccgggctc cagctccggt gttgggtgtg 60 

ttcttacttt gcggatttta ccaccctgga attgttccgt acgcgcaggc gcgcgggcgc 120 

tctcccgtgc actctctgct gagctagcgg actgcccgcc tctctaaaac gtcctgtaac 180 

tgcggttccg ggagtggaaa cctaaacgcg cgtgcgcttc ttccacgcca cggaaaccgt 240 

gcaggcctgg tgtggtctcc aaagtgactg aacaatgcag aaggacagtg gcccactggt 300 

tcctttacat tattatggtt tcggctatgc ggccctggtg gctactggtg ggattattgg 360 

ctatgcaaaa gcaggtagtg tgccgtccct ggctgctgga ctcttctttg ggggcctggc 420 

aggcctgggt gcctaccagc tgtctcagga ccccaggaac gtgtgggttt tcctagctac 480 

gtctgggact ttggctggca ttatggggat gagattctac aactctggga aatttatgcc 540 

tgcaggtttg atcgcgggag ccagtttgct gatggttgcc aaacttggac ttagtatgtt 600 

gagttcaccc catccgtagt agccatagcc ctgcgtgggc tcatgatgag 650 
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<210> 137 
<211> 223 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 746355H1 

<400> 137 

ctacgcagca ctggttgctt ctggtgggat 
gtccctggct gcagggctgc tctttggcag 
tcaggatcca aggaacgttt gggttttcct 
gggaatgagg ttctaccact ctggaaaatt 



cattggctat gtaaaagcag gcagcgtgcc 60 

tctagccggc ctgggtgctt accagctgtc 120 

agctacatct ggtaccttgg ctggcattat 180 

catgcctgca ggt 223 



<210> 138 
<211> 243 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No.: 1294663H1 

<400> 138 

ggaaaattca tgcctgtagg tttaattgca 
ggagttcgta tgttgatgac atctgattag 
aggattaaaa atctgcatct tccactattt 
ttttgcatct gacattttac ctaaaaaaaa 
aat 0 



ggtgccagtt tgctgatggc cgccaaagtt 60 
cagaagtcat gttccagctt ggactcatga 120 
tcaatgtatt aagagaaata agtgcagcat 180 
aaagacacca aatttggcgg aggggtggaa 240 

243 
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